Dataset statistics
| Number of variables | 24 |
|---|---|
| Number of observations | 1287 |
| Missing cells | 1 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 241.4 KiB |
| Average record size in memory | 192.1 B |
Variable types
| Numeric | 12 |
|---|---|
| Categorical | 12 |
imdb_id has a high cardinality: 1287 distinct values | High cardinality |
original_title has a high cardinality: 1280 distinct values | High cardinality |
cast has a high cardinality: 1278 distinct values | High cardinality |
homepage has a high cardinality: 1266 distinct values | High cardinality |
director has a high cardinality: 789 distinct values | High cardinality |
tagline has a high cardinality: 1283 distinct values | High cardinality |
keywords has a high cardinality: 1272 distinct values | High cardinality |
overview has a high cardinality: 1287 distinct values | High cardinality |
genres has a high cardinality: 496 distinct values | High cardinality |
production_companies has a high cardinality: 1138 distinct values | High cardinality |
release_date has a high cardinality: 1080 distinct values | High cardinality |
Unnamed: 0 is highly overall correlated with id and 1 other fields | High correlation |
id is highly overall correlated with Unnamed: 0 and 1 other fields | High correlation |
popularity is highly overall correlated with budget and 5 other fields | High correlation |
budget is highly overall correlated with popularity and 4 other fields | High correlation |
revenue is highly overall correlated with popularity and 5 other fields | High correlation |
vote_count is highly overall correlated with popularity and 5 other fields | High correlation |
release_year is highly overall correlated with Unnamed: 0 and 1 other fields | High correlation |
budget_adj is highly overall correlated with popularity and 5 other fields | High correlation |
revenue_adj is highly overall correlated with popularity and 5 other fields | High correlation |
profit is highly overall correlated with popularity and 4 other fields | High correlation |
imdb_id is uniformly distributed | Uniform |
original_title is uniformly distributed | Uniform |
cast is uniformly distributed | Uniform |
homepage is uniformly distributed | Uniform |
tagline is uniformly distributed | Uniform |
keywords is uniformly distributed | Uniform |
overview is uniformly distributed | Uniform |
production_companies is uniformly distributed | Uniform |
release_date is uniformly distributed | Uniform |
popularity_level is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
id has unique values | Unique |
imdb_id has unique values | Unique |
overview has unique values | Unique |
revenue_adj has unique values | Unique |
Reproduction
| Analysis started | 2023-01-31 16:17:18.843366 |
|---|---|
| Analysis finished | 2023-01-31 16:17:36.597991 |
| Duration | 17.75 seconds |
| Software version | pandas-profiling v3.6.6 |
| Download configuration | config.json |
Unnamed: 0
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 1287 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4125.843 |
| Minimum | 0 |
|---|---|
| Maximum | 10760 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 94.6 |
| Q1 | 1972 |
| median | 3523 |
| Q3 | 6554.5 |
| 95-th percentile | 8706.1 |
| Maximum | 10760 |
| Range | 10760 |
| Interquartile range (IQR) | 4582.5 |
Descriptive statistics
| Standard deviation | 2671.9366 |
|---|---|
| Coefficient of variation (CV) | 0.64760984 |
| Kurtosis | -0.8209793 |
| Mean | 4125.843 |
| Median Absolute Deviation (MAD) | 1993 |
| Skewness | 0.3638638 |
| Sum | 5309960 |
| Variance | 7139245.1 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 0 | 1 | 0.1% |
| 4715 | 1 | 0.1% |
| 5449 | 1 | 0.1% |
| 5447 | 1 | 0.1% |
| 5445 | 1 | 0.1% |
| 5442 | 1 | 0.1% |
| 5438 | 1 | 0.1% |
| 5437 | 1 | 0.1% |
| 5434 | 1 | 0.1% |
| 5433 | 1 | 0.1% |
| Other values (1277) | 1277 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 10760 | 1 | |
| 10759 | 1 | |
| 10724 | 1 | |
| 10689 | 1 | |
| 10595 | 1 | |
| 10594 | 1 | |
| 10489 | 1 | |
| 10438 | 1 | |
| 10401 | 1 | |
| 10338 | 1 |
id
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 1287 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52557.491 |
| Minimum | 11 |
|---|---|
| Maximum | 333348 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.2 KiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 348.6 |
| Q1 | 5851.5 |
| median | 20178 |
| Q3 | 62209.5 |
| 95-th percentile | 249483 |
| Maximum | 333348 |
| Range | 333337 |
| Interquartile range (IQR) | 56358 |
Descriptive statistics
| Standard deviation | 74450.077 |
|---|---|
| Coefficient of variation (CV) | 1.4165455 |
| Kurtosis | 3.3461861 |
| Mean | 52557.491 |
| Median Absolute Deviation (MAD) | 19466 |
| Skewness | 2.0238079 |
| Sum | 67641491 |
| Variance | 5.542814 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 135397 | 1 | 0.1% |
| 71679 | 1 | 0.1% |
| 136400 | 1 | 0.1% |
| 80274 | 1 | 0.1% |
| 72190 | 1 | 0.1% |
| 47964 | 1 | 0.1% |
| 138843 | 1 | 0.1% |
| 87421 | 1 | 0.1% |
| 93456 | 1 | 0.1% |
| 152601 | 1 | 0.1% |
| Other values (1277) | 1277 |
| Value | Count | Frequency (%) |
| 11 | 1 | |
| 12 | 1 | |
| 14 | 1 | |
| 22 | 1 | |
| 24 | 1 | |
| 28 | 1 | |
| 35 | 1 | |
| 38 | 1 | |
| 58 | 1 | |
| 65 | 1 |
| Value | Count | Frequency (%) |
| 333348 | 1 | |
| 328589 | 1 | |
| 328425 | 1 | |
| 325348 | 1 | |
| 323272 | 1 | |
| 321741 | 1 | |
| 320588 | 1 | |
| 318846 | 1 | |
| 314365 | 1 | |
| 312221 | 1 |
imdb_id
Categorical
HIGH CARDINALITY  UNIFORM  UNIQUE 
| Distinct | 1287 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.2 KiB |
| tt0369610 | 1 |
|---|---|
| tt1855325 | 1 |
| tt1272878 | 1 |
| tt1731141 | 1 |
| tt0816711 | 1 |
| Other values (1282) |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 11583 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1287 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | tt0369610 |
|---|---|
| 2nd row | tt1392190 |
| 3rd row | tt2908446 |
| 4th row | tt2488496 |
| 5th row | tt2820852 |
Common Values
| Value | Count | Frequency (%) |
| tt0369610 | 1 | 0.1% |
| tt1855325 | 1 | 0.1% |
| tt1272878 | 1 | 0.1% |
| tt1731141 | 1 | 0.1% |
| tt0816711 | 1 | 0.1% |
| tt1606378 | 1 | 0.1% |
| tt1457767 | 1 | 0.1% |
| tt1411250 | 1 | 0.1% |
| tt1690953 | 1 | 0.1% |
| tt1798709 | 1 | 0.1% |
| Other values (1277) | 1277 |
Length
| Value | Count | Frequency (%) |
| tt0369610 | 1 | 0.1% |
| tt1013743 | 1 | 0.1% |
| tt2908446 | 1 | 0.1% |
| tt2488496 | 1 | 0.1% |
| tt2820852 | 1 | 0.1% |
| tt1663202 | 1 | 0.1% |
| tt1340138 | 1 | 0.1% |
| tt3659388 | 1 | 0.1% |
| tt2293640 | 1 | 0.1% |
| tt1964418 | 1 | 0.1% |
| Other values (1277) | 1277 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 2574 | |
| 0 | 1475 | |
| 1 | 1316 | |
| 2 | 904 | 7.8% |
| 4 | 883 | 7.6% |
| 3 | 830 | 7.2% |
| 8 | 776 | 6.7% |
| 7 | 727 | 6.3% |
| 6 | 710 | 6.1% |
| 9 | 703 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9009 | |
| Lowercase Letter | 2574 | 22.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1475 | |
| 1 | 1316 | |
| 2 | 904 | |
| 4 | 883 | |
| 3 | 830 | |
| 8 | 776 | |
| 7 | 727 | |
| 6 | 710 | |
| 9 | 703 | |
| 5 | 685 |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2574 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9009 | |
| Latin | 2574 | 22.2% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1475 | |
| 1 | 1316 | |
| 2 | 904 | |
| 4 | 883 | |
| 3 | 830 | |
| 8 | 776 | |
| 7 | 727 | |
| 6 | 710 | |
| 9 | 703 | |
| 5 | 685 |
Latin
| Value | Count | Frequency (%) |
| t | 2574 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11583 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 2574 | |
| 0 | 1475 | |
| 1 | 1316 | |
| 2 | 904 | 7.8% |
| 4 | 883 | 7.6% |
| 3 | 830 | 7.2% |
| 8 | 776 | 6.7% |
| 7 | 727 | 6.3% |
| 6 | 710 | 6.1% |
| 9 | 703 | 6.1% |
popularity
Real number (ℝ)
| Distinct | 1286 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7860221 |
| Minimum | 0.010335 |
|---|---|
| Maximum | 32.985763 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.2 KiB |
Quantile statistics
| Minimum | 0.010335 |
|---|---|
| 5-th percentile | 0.2859329 |
| Q1 | 0.6647835 |
| median | 1.152354 |
| Q3 | 2.1253415 |
| 95-th percentile | 5.4437444 |
| Maximum | 32.985763 |
| Range | 32.975428 |
| Interquartile range (IQR) | 1.460558 |
Descriptive statistics
| Standard deviation | 2.172137 |
|---|---|
| Coefficient of variation (CV) | 1.2161871 |
| Kurtosis | 63.240427 |
| Mean | 1.7860221 |
| Median Absolute Deviation (MAD) | 0.608696 |
| Skewness | 6.0176957 |
| Sum | 2298.6104 |
| Variance | 4.7181792 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.107689 | 2 | 0.2% |
| 32.985763 | 1 | 0.1% |
| 2.032167 | 1 | 0.1% |
| 2.196946 | 1 | 0.1% |
| 2.476989 | 1 | 0.1% |
| 2.604638 | 1 | 0.1% |
| 2.815499 | 1 | 0.1% |
| 3.472358 | 1 | 0.1% |
| 3.518275 | 1 | 0.1% |
| 3.928789 | 1 | 0.1% |
| Other values (1276) | 1276 |
| Value | Count | Frequency (%) |
| 0.010335 | 1 | |
| 0.015997 | 1 | |
| 0.021371 | 1 | |
| 0.028456 | 1 | |
| 0.040858 | 1 | |
| 0.050524 | 1 | |
| 0.06324 | 1 | |
| 0.075624 | 1 | |
| 0.076109 | 1 | |
| 0.086287 | 1 |
| Value | Count | Frequency (%) |
| 32.985763 | 1 | |
| 28.419936 | 1 | |
| 24.949134 | 1 | |
| 14.311205 | 1 | |
| 13.112507 | 1 | |
| 12.971027 | 1 | |
| 12.037933 | 1 | |
| 11.422751 | 1 | |
| 11.173104 | 1 | |
| 10.739009 | 1 |
budget
Real number (ℝ)
| Distinct | 228 |
|---|---|
| Distinct (%) | 17.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 52003492 |
| Minimum | 1 |
|---|---|
| Maximum | 4.25 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2000000 |
| Q1 | 14000000 |
| median | 32000000 |
| Q3 | 70000000 |
| 95-th percentile | 1.7 × 108 |
| Maximum | 4.25 × 108 |
| Range | 4.25 × 108 |
| Interquartile range (IQR) | 56000000 |
Descriptive statistics
| Standard deviation | 55145404 |
|---|---|
| Coefficient of variation (CV) | 1.0604173 |
| Kurtosis | 4.3678954 |
| Mean | 52003492 |
| Median Absolute Deviation (MAD) | 23500000 |
| Skewness | 1.8587748 |
| Sum | 6.6928495 × 1010 |
| Variance | 3.0410156 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 30000000 | 53 | 4.1% |
| 40000000 | 51 | 4.0% |
| 15000000 | 50 | 3.9% |
| 20000000 | 45 | 3.5% |
| 25000000 | 40 | 3.1% |
| 50000000 | 36 | 2.8% |
| 60000000 | 35 | 2.7% |
| 35000000 | 35 | 2.7% |
| 150000000 | 33 | 2.6% |
| 10000000 | 29 | 2.3% |
| Other values (218) | 880 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 3 | 1 | |
| 30 | 1 | |
| 68 | 1 | |
| 75 | 1 | |
| 93 | 1 | |
| 7000 | 1 | |
| 8000 | 1 | |
| 15000 | 1 | |
| 17000 | 1 |
| Value | Count | Frequency (%) |
| 425000000 | 1 | 0.1% |
| 380000000 | 1 | 0.1% |
| 300000000 | 1 | 0.1% |
| 280000000 | 1 | 0.1% |
| 260000000 | 2 | 0.2% |
| 258000000 | 1 | 0.1% |
| 255000000 | 1 | 0.1% |
| 250000000 | 7 | |
| 245000000 | 1 | 0.1% |
| 237000000 | 1 | 0.1% |
revenue
Real number (ℝ)
| Distinct | 1285 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7624444 × 108 |
| Minimum | 43 |
|---|---|
| Maximum | 2.7815058 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.2 KiB |
Quantile statistics
| Minimum | 43 |
|---|---|
| 5-th percentile | 716278.1 |
| Q1 | 25650970 |
| median | 82087155 |
| Q3 | 2.1406942 × 108 |
| 95-th percentile | 7.0979216 × 108 |
| Maximum | 2.7815058 × 109 |
| Range | 2.7815058 × 109 |
| Interquartile range (IQR) | 1.8841845 × 108 |
Descriptive statistics
| Standard deviation | 2.5381558 × 108 |
|---|---|
| Coefficient of variation (CV) | 1.4401338 |
| Kurtosis | 16.155231 |
| Mean | 1.7624444 × 108 |
| Median Absolute Deviation (MAD) | 70976180 |
| Skewness | 3.1750379 |
| Sum | 2.2682659 × 1011 |
| Variance | 6.4422347 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 70000000 | 2 | 0.2% |
| 2500000 | 2 | 0.2% |
| 1513528810 | 1 | 0.1% |
| 970761885 | 1 | 0.1% |
| 131940411 | 1 | 0.1% |
| 125537191 | 1 | 0.1% |
| 531865000 | 1 | 0.1% |
| 304654182 | 1 | 0.1% |
| 318000141 | 1 | 0.1% |
| 98337295 | 1 | 0.1% |
| Other values (1275) | 1275 |
| Value | Count | Frequency (%) |
| 43 | 1 | |
| 46 | 1 | |
| 134 | 1 | |
| 193 | 1 | |
| 200 | 1 | |
| 1378 | 1 | |
| 7306 | 1 | |
| 15071 | 1 | |
| 18097 | 1 | |
| 30471 | 1 |
| Value | Count | Frequency (%) |
| 2781505847 | 1 | |
| 2068178225 | 1 | |
| 1845034188 | 1 | |
| 1519557910 | 1 | |
| 1513528810 | 1 | |
| 1506249360 | 1 | |
| 1405035767 | 1 | |
| 1327817822 | 1 | |
| 1274219009 | 1 | |
| 1215439994 | 1 |
original_title
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 1280 |
|---|---|
| Distinct (%) | 99.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.2 KiB |
| Halloween | 2 |
|---|---|
| The Three Musketeers | 2 |
| Wanted | 2 |
| The Thing | 2 |
| Halloween II | 2 |
| Other values (1275) |
Length
| Max length | 83 |
|---|---|
| Median length | 45 |
| Mean length | 15.009324 |
| Min length | 2 |
Characters and Unicode
| Total characters | 19317 |
|---|---|
| Distinct characters | 95 |
| Distinct categories | 15 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 1273 ? |
|---|---|
| Unique (%) | 98.9% |
Sample
| 1st row | Jurassic World |
|---|---|
| 2nd row | Mad Max: Fury Road |
| 3rd row | Insurgent |
| 4th row | Star Wars: The Force Awakens |
| 5th row | Furious 7 |
Common Values
| Value | Count | Frequency (%) |
| Halloween | 2 | 0.2% |
| The Three Musketeers | 2 | 0.2% |
| Wanted | 2 | 0.2% |
| The Thing | 2 | 0.2% |
| Halloween II | 2 | 0.2% |
| Clash of the Titans | 2 | 0.2% |
| The Fog | 2 | 0.2% |
| Jurassic World | 1 | 0.1% |
| The Conjuring | 1 | 0.1% |
| Riddick | 1 | 0.1% |
| Other values (1270) | 1270 |
Length
| Value | Count | Frequency (%) |
| the | 419 | 12.0% |
| of | 121 | 3.5% |
| a | 43 | 1.2% |
| and | 42 | 1.2% |
| in | 35 | 1.0% |
| 33 | 0.9% | |
| 2 | 27 | 0.8% |
| to | 21 | 0.6% |
| man | 18 | 0.5% |
| love | 13 | 0.4% |
| Other values (1719) | 2726 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2211 | 11.4% | |
| e | 1968 | 10.2% |
| a | 1192 | 6.2% |
| o | 1172 | 6.1% |
| n | 1071 | 5.5% |
| r | 1051 | 5.4% |
| t | 974 | 5.0% |
| i | 968 | 5.0% |
| s | 753 | 3.9% |
| h | 731 | 3.8% |
| Other values (85) | 7226 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13580 | |
| Uppercase Letter | 3092 | 16.0% |
| Space Separator | 2212 | 11.5% |
| Other Punctuation | 250 | 1.3% |
| Decimal Number | 129 | 0.7% |
| Dash Punctuation | 33 | 0.2% |
| Modifier Symbol | 5 | < 0.1% |
| Other Symbol | 4 | < 0.1% |
| Other Number | 3 | < 0.1% |
| Final Punctuation | 2 | < 0.1% |
| Other values (5) | 7 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 448 | |
| S | 256 | 8.3% |
| M | 199 | 6.4% |
| A | 177 | 5.7% |
| B | 177 | 5.7% |
| D | 171 | 5.5% |
| P | 151 | 4.9% |
| H | 145 | 4.7% |
| C | 145 | 4.7% |
| I | 137 | 4.4% |
| Other values (21) | 1086 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1968 | |
| a | 1192 | 8.8% |
| o | 1172 | 8.6% |
| n | 1071 | 7.9% |
| r | 1051 | 7.7% |
| t | 974 | 7.2% |
| i | 968 | 7.1% |
| s | 753 | 5.5% |
| h | 731 | 5.4% |
| l | 612 | 4.5% |
| Other values (16) | 3088 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 105 | |
| ' | 48 | |
| . | 40 | 16.0% |
| & | 22 | 8.8% |
| , | 20 | 8.0% |
| ! | 9 | 3.6% |
| ? | 2 | 0.8% |
| / | 2 | 0.8% |
| ¡ | 1 | 0.4% |
| · | 1 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 37 | |
| 3 | 26 | |
| 1 | 21 | |
| 0 | 18 | |
| 7 | 7 | 5.4% |
| 5 | 6 | 4.7% |
| 4 | 5 | 3.9% |
| 8 | 5 | 3.9% |
| 9 | 2 | 1.6% |
| 6 | 2 | 1.6% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ¸ | 2 | |
| ´ | 2 | |
| ¨ | 1 |
Other Number
| Value | Count | Frequency (%) |
| ¼ | 1 | |
| ½ | 1 | |
| ³ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2211 | ||
| 1 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3 | |
| © | 1 | 25.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 | |
| ‚ | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 33 |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 2 |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Other Letter
| Value | Count | Frequency (%) |
| ª | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16673 | |
| Common | 2644 | 13.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1968 | 11.8% |
| a | 1192 | 7.1% |
| o | 1172 | 7.0% |
| n | 1071 | 6.4% |
| r | 1051 | 6.3% |
| t | 974 | 5.8% |
| i | 968 | 5.8% |
| s | 753 | 4.5% |
| h | 731 | 4.4% |
| l | 612 | 3.7% |
| Other values (48) | 6181 |
Common
| Value | Count | Frequency (%) |
| 2211 | ||
| : | 105 | 4.0% |
| ' | 48 | 1.8% |
| . | 40 | 1.5% |
| 2 | 37 | 1.4% |
| - | 33 | 1.2% |
| 3 | 26 | 1.0% |
| & | 22 | 0.8% |
| 1 | 21 | 0.8% |
| , | 20 | 0.8% |
| Other values (27) | 81 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19272 | |
| None | 42 | 0.2% |
| Currency Symbols | 2 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2211 | 11.5% | |
| e | 1968 | 10.2% |
| a | 1192 | 6.2% |
| o | 1172 | 6.1% |
| n | 1071 | 5.6% |
| r | 1051 | 5.5% |
| t | 974 | 5.1% |
| i | 968 | 5.0% |
| s | 753 | 3.9% |
| h | 731 | 3.8% |
| Other values (65) | 7181 |
None
| Value | Count | Frequency (%) |
| Ð | 14 | |
| Ã | 4 | 9.5% |
| Ñ | 4 | 9.5% |
| ° | 3 | 7.1% |
| ¸ | 2 | 4.8% |
| » | 2 | 4.8% |
| ´ | 2 | 4.8% |
| ¼ | 1 | 2.4% |
| Š | 1 | 2.4% |
| Â | 1 | 2.4% |
| Other values (8) | 8 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 2 |
Punctuation
| Value | Count | Frequency (%) |
| ‚ | 1 |
cast
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 1278 |
|---|---|
| Distinct (%) | 99.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.2 KiB |
| Elijah Wood|Ian McKellen|Viggo Mortensen|Liv Tyler|Orlando Bloom | 3 |
|---|---|
| Jennifer Lawrence|Josh Hutcherson|Liam Hemsworth|Woody Harrelson|Elizabeth Banks | 3 |
| Mike Myers|Eddie Murphy|Cameron Diaz|Julie Andrews|Antonio Banderas | 2 |
| Mark Hamill|Harrison Ford|Carrie Fisher|Billy Dee Williams|Anthony Daniels | 2 |
| Martin Freeman|Ian McKellen|Richard Armitage|Ken Stott|Graham McTavish | 2 |
| Other values (1273) |
Length
| Max length | 98 |
|---|---|
| Median length | 85 |
| Mean length | 69.337218 |
| Min length | 9 |
Characters and Unicode
| Total characters | 89237 |
|---|---|
| Distinct characters | 95 |
| Distinct categories | 15 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 1271 ? |
|---|---|
| Unique (%) | 98.8% |
Sample
| 1st row | Chris Pratt|Bryce Dallas Howard|Irrfan Khan|Vincent D'Onofrio|Nick Robinson |
|---|---|
| 2nd row | Tom Hardy|Charlize Theron|Hugh Keays-Byrne|Nicholas Hoult|Josh Helman |
| 3rd row | Shailene Woodley|Theo James|Kate Winslet|Ansel Elgort|Miles Teller |
| 4th row | Harrison Ford|Mark Hamill|Carrie Fisher|Adam Driver|Daisy Ridley |
| 5th row | Vin Diesel|Paul Walker|Jason Statham|Michelle Rodriguez|Dwayne Johnson |
Common Values
| Value | Count | Frequency (%) |
| Elijah Wood|Ian McKellen|Viggo Mortensen|Liv Tyler|Orlando Bloom | 3 | 0.2% |
| Jennifer Lawrence|Josh Hutcherson|Liam Hemsworth|Woody Harrelson|Elizabeth Banks | 3 | 0.2% |
| Mike Myers|Eddie Murphy|Cameron Diaz|Julie Andrews|Antonio Banderas | 2 | 0.2% |
| Mark Hamill|Harrison Ford|Carrie Fisher|Billy Dee Williams|Anthony Daniels | 2 | 0.2% |
| Martin Freeman|Ian McKellen|Richard Armitage|Ken Stott|Graham McTavish | 2 | 0.2% |
| Ewan McGregor|Natalie Portman|Hayden Christensen|Ian McDiarmid|Samuel L. Jackson | 2 | 0.2% |
| Kristen Stewart|Robert Pattinson|Taylor Lautner|Ashley Greene|Peter Facinelli | 2 | 0.2% |
| Chris Pratt|Bryce Dallas Howard|Irrfan Khan|Vincent D'Onofrio|Nick Robinson | 1 | 0.1% |
| Vin Diesel|Karl Urban|Katee Sackhoff|Jordi Mollà |Bokeem Woodbine | 1 | 0.1% |
| Denzel Washington|Mark Wahlberg|Paula Patton|Bill Paxton|Fred Ward | 1 | 0.1% |
| Other values (1268) | 1268 |
Length
| Value | Count | Frequency (%) |
| tom | 33 | 0.4% |
| michael | 22 | 0.3% |
| ben | 22 | 0.3% |
| james | 21 | 0.3% |
| jason | 20 | 0.2% |
| de | 20 | 0.2% |
| mark | 20 | 0.2% |
| patrick | 18 | 0.2% |
| l | 18 | 0.2% |
| chris | 17 | 0.2% |
| Other values (6385) | 7893 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 7966 | 8.9% |
| a | 7445 | 8.3% |
| 6817 | 7.6% | |
| n | 6237 | 7.0% |
| | | 5131 | 5.7% |
| i | 5114 | 5.7% |
| r | 5113 | 5.7% |
| o | 4662 | 5.2% |
| l | 4390 | 4.9% |
| s | 3180 | 3.6% |
| Other values (85) | 33182 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 63187 | |
| Uppercase Letter | 13663 | 15.3% |
| Space Separator | 6820 | 7.6% |
| Math Symbol | 5137 | 5.8% |
| Other Punctuation | 243 | 0.3% |
| Dash Punctuation | 84 | 0.1% |
| Other Symbol | 45 | 0.1% |
| Format | 14 | < 0.1% |
| Modifier Symbol | 13 | < 0.1% |
| Initial Punctuation | 11 | < 0.1% |
| Other values (5) | 20 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 1238 | 9.1% |
| M | 1117 | 8.2% |
| C | 1069 | 7.8% |
| S | 1046 | 7.7% |
| B | 1031 | 7.5% |
| R | 807 | 5.9% |
| D | 787 | 5.8% |
| A | 757 | 5.5% |
| H | 621 | 4.5% |
| K | 606 | 4.4% |
| Other values (22) | 4584 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 7966 | |
| a | 7445 | |
| n | 6237 | |
| i | 5114 | 8.1% |
| r | 5113 | 8.1% |
| o | 4662 | 7.4% |
| l | 4390 | 6.9% |
| s | 3180 | 5.0% |
| t | 3001 | 4.7% |
| h | 2436 | 3.9% |
| Other values (19) | 13643 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 153 | |
| ' | 54 | 22.2% |
| ¡ | 17 | 7.0% |
| ‡ | 7 | 2.9% |
| , | 5 | 2.1% |
| … | 3 | 1.2% |
| ¶ | 2 | 0.8% |
| • | 1 | 0.4% |
| § | 1 | 0.4% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ¸ | 10 | |
| ¯ | 1 | 7.7% |
| ¨ | 1 | 7.7% |
| ´ | 1 | 7.7% |
Other Number
| Value | Count | Frequency (%) |
| ¼ | 3 | |
| ³ | 1 | 16.7% |
| ² | 1 | 16.7% |
| ¹ | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 6817 | ||
| 3 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 5131 | |
| ± | 6 | 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| © | 44 | |
| ™ | 1 | 2.2% |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 9 | |
| “ | 2 | 18.2% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¥ | 7 | |
| £ | 2 | 22.2% |
Other Letter
| Value | Count | Frequency (%) |
| ª | 1 | |
| º | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 5 | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 84 |
Format
| Value | Count | Frequency (%) |
| | 14 |
Final Punctuation
| Value | Count | Frequency (%) |
| › | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 76851 | |
| Common | 12386 | 13.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 7966 | 10.4% |
| a | 7445 | 9.7% |
| n | 6237 | 8.1% |
| i | 5114 | 6.7% |
| r | 5113 | 6.7% |
| o | 4662 | 6.1% |
| l | 4390 | 5.7% |
| s | 3180 | 4.1% |
| t | 3001 | 3.9% |
| h | 2436 | 3.2% |
| Other values (52) | 27307 |
Common
| Value | Count | Frequency (%) |
| 6817 | ||
| | | 5131 | |
| . | 153 | 1.2% |
| - | 84 | 0.7% |
| ' | 54 | 0.4% |
| © | 44 | 0.4% |
| ¡ | 17 | 0.1% |
| | 14 | 0.1% |
| ¸ | 10 | 0.1% |
| « | 9 | 0.1% |
| Other values (23) | 53 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 88954 | |
| None | 268 | 0.3% |
| Punctuation | 14 | < 0.1% |
| Letterlike Symbols | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 7966 | 9.0% |
| a | 7445 | 8.4% |
| 6817 | 7.7% | |
| n | 6237 | 7.0% |
| | | 5131 | 5.8% |
| i | 5114 | 5.7% |
| r | 5113 | 5.7% |
| o | 4662 | 5.2% |
| l | 4390 | 4.9% |
| s | 3180 | 3.6% |
| Other values (50) | 32899 |
None
| Value | Count | Frequency (%) |
| Ã | 114 | |
| © | 44 | 16.4% |
| ¡ | 17 | 6.3% |
| | 14 | 5.2% |
| à | 11 | 4.1% |
| ¸ | 10 | 3.7% |
| « | 9 | 3.4% |
| Ä | 8 | 3.0% |
| ¥ | 7 | 2.6% |
| ± | 6 | 2.2% |
| Other values (19) | 28 | 10.4% |
Punctuation
| Value | Count | Frequency (%) |
| ‡ | 7 | |
| … | 3 | |
| “ | 2 | 14.3% |
| • | 1 | 7.1% |
| › | 1 | 7.1% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 1 |
homepage
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 1266 |
|---|---|
| Distinct (%) | 98.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.2 KiB |
| http://www.thehungergames.movie/ | 4 |
|---|---|
| http://www.missionimpossible.com/ | 4 |
| http://www.thehobbit.com/ | 3 |
| http://www.transformersmovie.com/ | 3 |
| http://disney.go.com/disneypictures/pirates/ | 2 |
| Other values (1261) |
Length
| Max length | 138 |
|---|---|
| Median length | 77 |
| Mean length | 36.724165 |
| Min length | 18 |
Characters and Unicode
| Total characters | 47264 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1251 ? |
|---|---|
| Unique (%) | 97.2% |
Sample
| 1st row | http://www.jurassicworld.com/ |
|---|---|
| 2nd row | http://www.madmaxmovie.com/ |
| 3rd row | http://www.thedivergentseries.movie/#insurgent |
| 4th row | http://www.starwars.com/films/star-wars-episode-vii |
| 5th row | http://www.furious7.com/ |
Common Values
| Value | Count | Frequency (%) |
| http://www.thehungergames.movie/ | 4 | 0.3% |
| http://www.missionimpossible.com/ | 4 | 0.3% |
| http://www.thehobbit.com/ | 3 | 0.2% |
| http://www.transformersmovie.com/ | 3 | 0.2% |
| http://disney.go.com/disneypictures/pirates/ | 2 | 0.2% |
| http://www.theamazingspiderman.com | 2 | 0.2% |
| http://www.indianajones.com | 2 | 0.2% |
| http://phantasm.com | 2 | 0.2% |
| http://www.howtotrainyourdragon.com/ | 2 | 0.2% |
| http://www.lordoftherings.net/ | 2 | 0.2% |
| Other values (1256) | 1261 |
Length
| Value | Count | Frequency (%) |
| http://www.missionimpossible.com | 5 | 0.4% |
| http://www.thehungergames.movie | 4 | 0.3% |
| http://www.transformersmovie.com | 4 | 0.3% |
| http://www.thehobbit.com | 3 | 0.2% |
| http://www.lordoftherings.net | 3 | 0.2% |
| http://www.ironmanmovie.com | 2 | 0.2% |
| http://www.twilightthemovie.com | 2 | 0.2% |
| http://www.harrypotter.com | 2 | 0.2% |
| http://www.kungfupanda.com | 2 | 0.2% |
| http://stepupmovie.com | 2 | 0.2% |
| Other values (1252) | 1258 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 4352 | 9.2% |
| t | 4306 | 9.1% |
| o | 3413 | 7.2% |
| e | 3394 | 7.2% |
| w | 3329 | 7.0% |
| m | 2707 | 5.7% |
| . | 2558 | 5.4% |
| h | 2323 | 4.9% |
| i | 2309 | 4.9% |
| c | 1862 | 3.9% |
| Other values (63) | 16711 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37824 | |
| Other Punctuation | 8246 | 17.4% |
| Dash Punctuation | 473 | 1.0% |
| Decimal Number | 434 | 0.9% |
| Uppercase Letter | 197 | 0.4% |
| Connector Punctuation | 67 | 0.1% |
| Math Symbol | 19 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 4306 | |
| o | 3413 | 9.0% |
| e | 3394 | 9.0% |
| w | 3329 | 8.8% |
| m | 2707 | 7.2% |
| h | 2323 | 6.1% |
| i | 2309 | 6.1% |
| c | 1862 | 4.9% |
| p | 1791 | 4.7% |
| r | 1739 | 4.6% |
| Other values (16) | 10651 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 17 | 8.6% |
| T | 17 | 8.6% |
| M | 16 | 8.1% |
| S | 16 | 8.1% |
| A | 15 | 7.6% |
| D | 14 | 7.1% |
| N | 12 | 6.1% |
| L | 10 | 5.1% |
| O | 9 | 4.6% |
| G | 8 | 4.1% |
| Other values (13) | 63 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 89 | |
| 1 | 73 | |
| 0 | 63 | |
| 3 | 62 | |
| 9 | 30 | 6.9% |
| 4 | 30 | 6.9% |
| 7 | 28 | 6.5% |
| 8 | 22 | 5.1% |
| 5 | 22 | 5.1% |
| 6 | 15 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 4352 | |
| . | 2558 | |
| : | 1287 | 15.6% |
| # | 18 | 0.2% |
| ? | 13 | 0.2% |
| % | 9 | 0.1% |
| & | 6 | 0.1% |
| , | 2 | < 0.1% |
| ! | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 473 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 67 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 19 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38021 | |
| Common | 9243 | 19.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 4306 | 11.3% |
| o | 3413 | 9.0% |
| e | 3394 | 8.9% |
| w | 3329 | 8.8% |
| m | 2707 | 7.1% |
| h | 2323 | 6.1% |
| i | 2309 | 6.1% |
| c | 1862 | 4.9% |
| p | 1791 | 4.7% |
| r | 1739 | 4.6% |
| Other values (39) | 10848 |
Common
| Value | Count | Frequency (%) |
| / | 4352 | |
| . | 2558 | |
| : | 1287 | 13.9% |
| - | 473 | 5.1% |
| 2 | 89 | 1.0% |
| 1 | 73 | 0.8% |
| _ | 67 | 0.7% |
| 0 | 63 | 0.7% |
| 3 | 62 | 0.7% |
| 9 | 30 | 0.3% |
| Other values (14) | 189 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 47264 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 4352 | 9.2% |
| t | 4306 | 9.1% |
| o | 3413 | 7.2% |
| e | 3394 | 7.2% |
| w | 3329 | 7.0% |
| m | 2707 | 5.7% |
| . | 2558 | 5.4% |
| h | 2323 | 4.9% |
| i | 2309 | 4.9% |
| c | 1862 | 3.9% |
| Other values (63) | 16711 |
director
Categorical
| Distinct | 789 |
|---|---|
| Distinct (%) | 61.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.2 KiB |
| John Carpenter | 12 |
|---|---|
| Steven Spielberg | 11 |
| Steven Soderbergh | 10 |
| Robert Zemeckis | 8 |
| Clint Eastwood | 8 |
| Other values (784) |
Length
| Max length | 79 |
|---|---|
| Median length | 40 |
| Mean length | 14.3885 |
| Min length | 3 |
Characters and Unicode
| Total characters | 18518 |
|---|---|
| Distinct characters | 69 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 535 ? |
|---|---|
| Unique (%) | 41.6% |
Sample
| 1st row | Colin Trevorrow |
|---|---|
| 2nd row | George Miller |
| 3rd row | Robert Schwentke |
| 4th row | J.J. Abrams |
| 5th row | James Wan |
Common Values
| Value | Count | Frequency (%) |
| John Carpenter | 12 | 0.9% |
| Steven Spielberg | 11 | 0.9% |
| Steven Soderbergh | 10 | 0.8% |
| Robert Zemeckis | 8 | 0.6% |
| Clint Eastwood | 8 | 0.6% |
| Ridley Scott | 8 | 0.6% |
| Peter Jackson | 8 | 0.6% |
| Ron Howard | 7 | 0.5% |
| Christopher Nolan | 7 | 0.5% |
| Paul W.S. Anderson | 7 | 0.5% |
| Other values (779) | 1201 |
Length
| Value | Count | Frequency (%) |
| john | 57 | 2.0% |
| david | 50 | 1.8% |
| peter | 32 | 1.1% |
| steven | 28 | 1.0% |
| paul | 26 | 0.9% |
| michael | 24 | 0.9% |
| robert | 22 | 0.8% |
| james | 20 | 0.7% |
| rob | 20 | 0.7% |
| martin | 17 | 0.6% |
| Other values (1179) | 2488 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1760 | 9.5% |
| 1498 | 8.1% | |
| a | 1395 | 7.5% |
| n | 1335 | 7.2% |
| r | 1300 | 7.0% |
| o | 1083 | 5.8% |
| i | 1052 | 5.7% |
| l | 835 | 4.5% |
| t | 682 | 3.7% |
| s | 652 | 3.5% |
| Other values (59) | 6926 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13777 | |
| Uppercase Letter | 3000 | 16.2% |
| Space Separator | 1498 | 8.1% |
| Math Symbol | 126 | 0.7% |
| Other Punctuation | 88 | 0.5% |
| Dash Punctuation | 10 | 0.1% |
| Currency Symbol | 8 | < 0.1% |
| Other Number | 4 | < 0.1% |
| Other Symbol | 3 | < 0.1% |
| Final Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 319 | 10.6% |
| J | 264 | 8.8% |
| M | 249 | 8.3% |
| R | 204 | 6.8% |
| C | 203 | 6.8% |
| B | 183 | 6.1% |
| D | 170 | 5.7% |
| G | 167 | 5.6% |
| A | 161 | 5.4% |
| L | 140 | 4.7% |
| Other values (18) | 940 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1760 | |
| a | 1395 | |
| n | 1335 | |
| r | 1300 | |
| o | 1083 | 7.9% |
| i | 1052 | 7.6% |
| l | 835 | 6.1% |
| t | 682 | 5.0% |
| s | 652 | 4.7% |
| h | 519 | 3.8% |
| Other values (16) | 3164 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 62 | |
| ¡ | 12 | 13.6% |
| ¶ | 8 | 9.1% |
| ' | 4 | 4.5% |
| ‰ | 1 | 1.1% |
| § | 1 | 1.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 124 | |
| ± | 2 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 1498 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¥ | 8 |
Other Number
| Value | Count | Frequency (%) |
| ³ | 4 |
Other Symbol
| Value | Count | Frequency (%) |
| © | 3 |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 2 |
Format
| Value | Count | Frequency (%) |
| | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16777 | |
| Common | 1741 | 9.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1760 | 10.5% |
| a | 1395 | 8.3% |
| n | 1335 | 8.0% |
| r | 1300 | 7.7% |
| o | 1083 | 6.5% |
| i | 1052 | 6.3% |
| l | 835 | 5.0% |
| t | 682 | 4.1% |
| s | 652 | 3.9% |
| h | 519 | 3.1% |
| Other values (44) | 6164 |
Common
| Value | Count | Frequency (%) |
| 1498 | ||
| | | 124 | 7.1% |
| . | 62 | 3.6% |
| ¡ | 12 | 0.7% |
| - | 10 | 0.6% |
| ¶ | 8 | 0.5% |
| ¥ | 8 | 0.5% |
| ³ | 4 | 0.2% |
| ' | 4 | 0.2% |
| © | 3 | 0.2% |
| Other values (5) | 8 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18432 | |
| None | 85 | 0.5% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1760 | 9.5% |
| 1498 | 8.1% | |
| a | 1395 | 7.6% |
| n | 1335 | 7.2% |
| r | 1300 | 7.1% |
| o | 1083 | 5.9% |
| i | 1052 | 5.7% |
| l | 835 | 4.5% |
| t | 682 | 3.7% |
| s | 652 | 3.5% |
| Other values (47) | 6840 |
None
| Value | Count | Frequency (%) |
| Ã | 42 | |
| ¡ | 12 | 14.1% |
| ¶ | 8 | 9.4% |
| ¥ | 8 | 9.4% |
| ³ | 4 | 4.7% |
| © | 3 | 3.5% |
| » | 2 | 2.4% |
| ± | 2 | 2.4% |
| | 2 | 2.4% |
| § | 1 | 1.2% |
Punctuation
| Value | Count | Frequency (%) |
| ‰ | 1 |
tagline
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 1283 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.2 KiB |
| The only way out is down. | 2 |
|---|---|
| Love is a force of nature. | 2 |
| Evil will rise. | 2 |
| One ordinary couple. One little white lie. | 2 |
| Where There Are Gods, There Are Monsters. | 1 |
| Other values (1278) |
Length
| Max length | 286 |
|---|---|
| Median length | 96 |
| Mean length | 37.982906 |
| Min length | 3 |
Characters and Unicode
| Total characters | 48884 |
|---|---|
| Distinct characters | 91 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 1279 ? |
|---|---|
| Unique (%) | 99.4% |
Sample
| 1st row | The park is open. |
|---|---|
| 2nd row | What a Lovely Day. |
| 3rd row | One Choice Can Destroy You |
| 4th row | Every generation has a story. |
| 5th row | Vengeance Hits Home |
Common Values
| Value | Count | Frequency (%) |
| The only way out is down. | 2 | 0.2% |
| Love is a force of nature. | 2 | 0.2% |
| Evil will rise. | 2 | 0.2% |
| One ordinary couple. One little white lie. | 2 | 0.2% |
| Where There Are Gods, There Are Monsters. | 1 | 0.1% |
| 2 Guns, 1 Bank. | 1 | 0.1% |
| This is not a game. | 1 | 0.1% |
| Remember Philly! | 1 | 0.1% |
| Yippee Ki-Yay Mother Russia | 1 | 0.1% |
| Based on the true case files of the Warrens | 1 | 0.1% |
| Other values (1273) | 1273 |
Length
| Value | Count | Frequency (%) |
| the | 553 | 6.1% |
| a | 309 | 3.4% |
| is | 217 | 2.4% |
| to | 186 | 2.1% |
| you | 170 | 1.9% |
| of | 168 | 1.9% |
| in | 122 | 1.3% |
| one | 109 | 1.2% |
| it | 90 | 1.0% |
| and | 75 | 0.8% |
| Other values (2027) | 7059 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7778 | ||
| e | 5218 | 10.7% |
| o | 3034 | 6.2% |
| t | 3008 | 6.2% |
| a | 2544 | 5.2% |
| n | 2473 | 5.1% |
| i | 2355 | 4.8% |
| r | 2339 | 4.8% |
| s | 2298 | 4.7% |
| h | 1847 | 3.8% |
| Other values (81) | 15990 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 35588 | |
| Space Separator | 7778 | 15.9% |
| Uppercase Letter | 3036 | 6.2% |
| Other Punctuation | 2242 | 4.6% |
| Decimal Number | 172 | 0.4% |
| Dash Punctuation | 38 | 0.1% |
| Currency Symbol | 13 | < 0.1% |
| Other Symbol | 9 | < 0.1% |
| Open Punctuation | 4 | < 0.1% |
| Close Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5218 | |
| o | 3034 | 8.5% |
| t | 3008 | 8.5% |
| a | 2544 | 7.1% |
| n | 2473 | 6.9% |
| i | 2355 | 6.6% |
| r | 2339 | 6.6% |
| s | 2298 | 6.5% |
| h | 1847 | 5.2% |
| l | 1462 | 4.1% |
| Other values (24) | 9010 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 427 | |
| A | 262 | 8.6% |
| S | 205 | 6.8% |
| W | 201 | 6.6% |
| I | 198 | 6.5% |
| H | 188 | 6.2% |
| B | 177 | 5.8% |
| O | 141 | 4.6% |
| N | 134 | 4.4% |
| L | 129 | 4.2% |
| Other values (15) | 974 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1554 | |
| ' | 311 | 13.9% |
| , | 213 | 9.5% |
| ? | 80 | 3.6% |
| ! | 71 | 3.2% |
| : | 5 | 0.2% |
| % | 3 | 0.1% |
| * | 1 | < 0.1% |
| # | 1 | < 0.1% |
| & | 1 | < 0.1% |
| Other values (2) | 2 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 62 | |
| 1 | 29 | |
| 2 | 18 | 10.5% |
| 7 | 16 | 9.3% |
| 9 | 13 | 7.6% |
| 5 | 10 | 5.8% |
| 3 | 8 | 4.7% |
| 8 | 8 | 4.7% |
| 4 | 4 | 2.3% |
| 6 | 4 | 2.3% |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 11 | |
| $ | 2 | 15.4% |
Other Symbol
| Value | Count | Frequency (%) |
| ¦ | 5 | |
| ™ | 4 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 | |
| „ | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 7778 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 38 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 |
Modifier Letter
| Value | Count | Frequency (%) |
| ˆ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38624 | |
| Common | 10260 | 21.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5218 | |
| o | 3034 | 7.9% |
| t | 3008 | 7.8% |
| a | 2544 | 6.6% |
| n | 2473 | 6.4% |
| i | 2355 | 6.1% |
| r | 2339 | 6.1% |
| s | 2298 | 5.9% |
| h | 1847 | 4.8% |
| l | 1462 | 3.8% |
| Other values (49) | 12046 |
Common
| Value | Count | Frequency (%) |
| 7778 | ||
| . | 1554 | 15.1% |
| ' | 311 | 3.0% |
| , | 213 | 2.1% |
| ? | 80 | 0.8% |
| ! | 71 | 0.7% |
| 0 | 62 | 0.6% |
| - | 38 | 0.4% |
| 1 | 29 | 0.3% |
| 2 | 18 | 0.2% |
| Other values (22) | 106 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48843 | |
| None | 23 | < 0.1% |
| Currency Symbols | 11 | < 0.1% |
| Letterlike Symbols | 4 | < 0.1% |
| Punctuation | 2 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7778 | ||
| e | 5218 | 10.7% |
| o | 3034 | 6.2% |
| t | 3008 | 6.2% |
| a | 2544 | 5.2% |
| n | 2473 | 5.1% |
| i | 2355 | 4.8% |
| r | 2339 | 4.8% |
| s | 2298 | 4.7% |
| h | 1847 | 3.8% |
| Other values (66) | 15949 |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 11 |
None
| Value | Count | Frequency (%) |
| â | 9 | |
| ¦ | 5 | |
| è | 2 | 8.7% |
| Ž | 1 | 4.3% |
| ž | 1 | 4.3% |
| š | 1 | 4.3% |
| ç | 1 | 4.3% |
| å | 1 | 4.3% |
| œ | 1 | 4.3% |
| æ | 1 | 4.3% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 4 |
Modifier Letters
| Value | Count | Frequency (%) |
| ˆ | 1 |
Punctuation
| Value | Count | Frequency (%) |
| „ | 1 | |
| … | 1 |
keywords
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 1272 |
|---|---|
| Distinct (%) | 98.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.2 KiB |
| duringcreditsstinger | 6 |
|---|---|
| woman director | 4 |
| aftercreditsstinger | 3 |
| aftercreditsstinger|duringcreditsstinger | 2 |
| independent film | 2 |
| Other values (1267) |
Length
| Max length | 131 |
|---|---|
| Median length | 81 |
| Mean length | 48.500389 |
| Min length | 3 |
Characters and Unicode
| Total characters | 62420 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 1264 ? |
|---|---|
| Unique (%) | 98.2% |
Sample
| 1st row | monster|dna|tyrannosaurus rex|velociraptor|island |
|---|---|
| 2nd row | future|chase|post-apocalyptic|dystopia|australia |
| 3rd row | based on novel|revolution|dystopia|sequel|dystopic future |
| 4th row | android|spaceship|jedi|space opera|3d |
| 5th row | car race|speed|revenge|suspense|car |
Common Values
| Value | Count | Frequency (%) |
| duringcreditsstinger | 6 | 0.5% |
| woman director | 4 | 0.3% |
| aftercreditsstinger | 3 | 0.2% |
| aftercreditsstinger|duringcreditsstinger | 2 | 0.2% |
| independent film | 2 | 0.2% |
| sequel | 2 | 0.2% |
| elves|dwarves|orcs|middle-earth (tolkien)|hobbits | 2 | 0.2% |
| independent film|woman director | 2 | 0.2% |
| undercover|undercover agent|based on comic book|number in title|money | 1 | 0.1% |
| angel|vampire|werewolf|warlock|downworlder | 1 | 0.1% |
| Other values (1262) | 1262 |
Length
| Value | Count | Frequency (%) |
| on | 124 | 3.0% |
| of | 107 | 2.6% |
| and | 54 | 1.3% |
| based | 48 | 1.2% |
| the | 43 | 1.1% |
| in | 39 | 1.0% |
| sister | 25 | 0.6% |
| new | 24 | 0.6% |
| brother | 23 | 0.6% |
| female | 21 | 0.5% |
| Other values (2912) | 3574 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5903 | 9.5% |
| i | 4762 | 7.6% |
| a | 4702 | 7.5% |
| | | 4605 | 7.4% |
| r | 4484 | 7.2% |
| n | 3980 | 6.4% |
| o | 3968 | 6.4% |
| t | 3750 | 6.0% |
| s | 3724 | 6.0% |
| 2794 | 4.5% | |
| Other values (40) | 19748 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 54756 | |
| Math Symbol | 4605 | 7.4% |
| Space Separator | 2797 | 4.5% |
| Dash Punctuation | 88 | 0.1% |
| Decimal Number | 76 | 0.1% |
| Other Punctuation | 72 | 0.1% |
| Uppercase Letter | 9 | < 0.1% |
| Close Punctuation | 6 | < 0.1% |
| Open Punctuation | 6 | < 0.1% |
| Other Symbol | 3 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5903 | |
| i | 4762 | 8.7% |
| a | 4702 | 8.6% |
| r | 4484 | 8.2% |
| n | 3980 | 7.3% |
| o | 3968 | 7.2% |
| t | 3750 | 6.8% |
| s | 3724 | 6.8% |
| l | 2582 | 4.7% |
| c | 2241 | 4.1% |
| Other values (16) | 14660 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 20 | |
| 1 | 14 | |
| 9 | 13 | |
| 0 | 12 | |
| 7 | 10 | |
| 2 | 3 | 3.9% |
| 5 | 2 | 2.6% |
| 4 | 1 | 1.3% |
| 6 | 1 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 36 | |
| ' | 35 | |
| · | 1 | 1.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Ã | 4 | |
| Â | 3 | |
| Î | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 2794 | ||
| 3 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 4605 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 88 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6 |
Other Symbol
| Value | Count | Frequency (%) |
| © | 3 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 1 |
Other Number
| Value | Count | Frequency (%) |
| ³ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 54765 | |
| Common | 7655 | 12.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5903 | |
| i | 4762 | 8.7% |
| a | 4702 | 8.6% |
| r | 4484 | 8.2% |
| n | 3980 | 7.3% |
| o | 3968 | 7.2% |
| t | 3750 | 6.8% |
| s | 3724 | 6.8% |
| l | 2582 | 4.7% |
| c | 2241 | 4.1% |
| Other values (19) | 14669 |
Common
| Value | Count | Frequency (%) |
| | | 4605 | |
| 2794 | ||
| - | 88 | 1.1% |
| . | 36 | 0.5% |
| ' | 35 | 0.5% |
| 3 | 20 | 0.3% |
| 1 | 14 | 0.2% |
| 9 | 13 | 0.2% |
| 0 | 12 | 0.2% |
| 7 | 10 | 0.1% |
| Other values (11) | 28 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 62402 | |
| None | 17 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 5903 | 9.5% |
| i | 4762 | 7.6% |
| a | 4702 | 7.5% |
| | | 4605 | 7.4% |
| r | 4484 | 7.2% |
| n | 3980 | 6.4% |
| o | 3968 | 6.4% |
| t | 3750 | 6.0% |
| s | 3724 | 6.0% |
| 2794 | 4.5% | |
| Other values (32) | 19730 |
None
| Value | Count | Frequency (%) |
| Ã | 4 | |
| © | 3 | |
| Â | 3 | |
| 3 | ||
| Î | 2 | |
| · | 1 | 5.9% |
| ³ | 1 | 5.9% |
Punctuation
| Value | Count | Frequency (%) |
| “ | 1 |
overview
Categorical
HIGH CARDINALITY  UNIFORM  UNIQUE 
| Distinct | 1287 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.2 KiB |
| Twenty-two years after the events of Jurassic Park, Isla Nublar now features a fully functioning dinosaur theme park, Jurassic World, as originally envisioned by John Hammond. | 1 |
|---|---|
| The Umbrella Corporation’s deadly T-virus continues to ravage the Earth, transforming the global population into legions of the flesh eating Undead. The human race’s last and only hope, Alice, awakens in the heart of Umbrella’s most clandestine operations facility and unveils more of her mysterious past as she delves further into the complex. Without a safe haven, Alice continues to hunt those responsible for the outbreak; a chase that takes her from Tokyo to New York, Washington, D.C. and Moscow, culminating in a mind-blowing revelation that will force her to rethink everything that she once thought to be true. Aided by new found allies and familiar friends, Alice must fight to survive long enough to escape a hostile world on the brink of oblivion. The countdown has begun. | 1 |
| A DEA agent and an undercover Naval Intelligence officer who have been tasked with investigating one another find they have been set up by the mob -- the very organization the two men believe they have been stealing money from. | 1 |
| Based on the classic novel by Orson Scott Card, Ender's Game is the story of the Earth's most gifted children training to defend their homeplanet in the space wars of the future. | 1 |
| Life for former United Nations investigator Gerry Lane and his family seems content. Suddenly, the world is plagued by a mysterious infection turning whole human populations into rampaging mindless zombies. After barely escaping the chaos, Lane is persuaded to go on a mission to investigate this disease. What follows is a perilous trek around the world where Lane must brave horrific dangers and long odds to find answers before human civilization falls. | 1 |
| Other values (1282) |
Length
| Max length | 1000 |
|---|---|
| Median length | 481 |
| Mean length | 311.14141 |
| Min length | 58 |
Characters and Unicode
| Total characters | 400439 |
|---|---|
| Distinct characters | 99 |
| Distinct categories | 14 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 6 ? |
Unique
| Unique | 1287 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Twenty-two years after the events of Jurassic Park, Isla Nublar now features a fully functioning dinosaur theme park, Jurassic World, as originally envisioned by John Hammond. |
|---|---|
| 2nd row | An apocalyptic story set in the furthest reaches of our planet, in a stark desert landscape where humanity is broken, and most everyone is crazed fighting for the necessities of life. Within this world exist two rebels on the run who just might be able to restore order. There's Max, a man of action and a man of few words, who seeks peace of mind following the loss of his wife and child in the aftermath of the chaos. And Furiosa, a woman of action and a woman who believes her path to survival may be achieved if she can make it across the desert back to her childhood homeland. |
| 3rd row | Beatrice Prior must confront her inner demons and continue her fight against a powerful alliance which threatens to tear her society apart. |
| 4th row | Thirty years after defeating the Galactic Empire, Han Solo and his allies face a new threat from the evil Kylo Ren and his army of Stormtroopers. |
| 5th row | Deckard Shaw seeks revenge against Dominic Toretto and his family for his comatose brother. |
Common Values
| Value | Count | Frequency (%) |
| Twenty-two years after the events of Jurassic Park, Isla Nublar now features a fully functioning dinosaur theme park, Jurassic World, as originally envisioned by John Hammond. | 1 | 0.1% |
| The Umbrella Corporation’s deadly T-virus continues to ravage the Earth, transforming the global population into legions of the flesh eating Undead. The human race’s last and only hope, Alice, awakens in the heart of Umbrella’s most clandestine operations facility and unveils more of her mysterious past as she delves further into the complex. Without a safe haven, Alice continues to hunt those responsible for the outbreak; a chase that takes her from Tokyo to New York, Washington, D.C. and Moscow, culminating in a mind-blowing revelation that will force her to rethink everything that she once thought to be true. Aided by new found allies and familiar friends, Alice must fight to survive long enough to escape a hostile world on the brink of oblivion. The countdown has begun. | 1 | 0.1% |
| A DEA agent and an undercover Naval Intelligence officer who have been tasked with investigating one another find they have been set up by the mob -- the very organization the two men believe they have been stealing money from. | 1 | 0.1% |
| Based on the classic novel by Orson Scott Card, Ender's Game is the story of the Earth's most gifted children training to defend their homeplanet in the space wars of the future. | 1 | 0.1% |
| Life for former United Nations investigator Gerry Lane and his family seems content. Suddenly, the world is plagued by a mysterious infection turning whole human populations into rampaging mindless zombies. After barely escaping the chaos, Lane is persuaded to go on a mission to investigate this disease. What follows is a perilous trek around the world where Lane must brave horrific dangers and long odds to find answers before human civilization falls. | 1 | 0.1% |
| Iconoclastic, take-no-prisoners cop John McClane, finds himself for the first time on foreign soil after traveling to Moscow to help his wayward son Jack - unaware that Jack is really a highly-trained CIA operative out to stop a nuclear weapons heist. With the Russian underworld in pursuit, and battling a countdown to war, the two McClanes discover that their opposing methods make them unstoppable heroes. | 1 | 0.1% |
| Paranormal investigators Ed and Lorraine Warren work to help a family terrorized by a dark presence in their farmhouse. Forced to confront a powerful entity, the Warrens find themselves caught in the most terrifying case of their lives. | 1 | 0.1% |
| Betrayed by his own kind and left for dead on a desolate planet, Riddick fights for survival against alien predators and becomes more powerful and dangerous than ever before. Soon bounty hunters from throughout the galaxy descend on Riddick only to find themselves pawns in his greater scheme for revenge. With his enemies right where he wants them, Riddick unleashes a vicious attack of vengeance before returning to his home planet of Furya to save it from destruction. | 1 | 0.1% |
| Gru is recruited by the Anti-Villain League to help deal with a powerful new super criminal. | 1 | 0.1% |
| In the not so distant future, Theodore, a lonely writer purchases a newly developed operating system designed to meet the user's every needs. To Theordore's surprise, a romantic relationship develops between him and his operating system. This unconventional love story blends science fiction and romance in a sweet tale that explores the nature of love and the ways that technology isolates and connects us all. | 1 | 0.1% |
| Other values (1277) | 1277 |
Length
| Value | Count | Frequency (%) |
| the | 3906 | 5.7% |
| a | 2776 | 4.1% |
| to | 2192 | 3.2% |
| and | 2011 | 3.0% |
| of | 1856 | 2.7% |
| in | 1198 | 1.8% |
| his | 1016 | 1.5% |
| is | 873 | 1.3% |
| with | 642 | 0.9% |
| her | 590 | 0.9% |
| Other values (11513) | 51039 |
Most occurring characters
| Value | Count | Frequency (%) |
| 66850 | ||
| e | 38852 | 9.7% |
| t | 26765 | 6.7% |
| a | 25985 | 6.5% |
| n | 23189 | 5.8% |
| o | 22910 | 5.7% |
| i | 22848 | 5.7% |
| r | 21357 | 5.3% |
| s | 21171 | 5.3% |
| h | 16776 | 4.2% |
| Other values (89) | 113736 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 311406 | |
| Space Separator | 66854 | 16.7% |
| Uppercase Letter | 10726 | 2.7% |
| Other Punctuation | 8239 | 2.1% |
| Dash Punctuation | 1190 | 0.3% |
| Decimal Number | 1052 | 0.3% |
| Currency Symbol | 286 | 0.1% |
| Open Punctuation | 202 | 0.1% |
| Close Punctuation | 202 | 0.1% |
| Other Symbol | 161 | < 0.1% |
| Other values (4) | 121 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 38852 | |
| t | 26765 | 8.6% |
| a | 25985 | 8.3% |
| n | 23189 | 7.4% |
| o | 22910 | 7.4% |
| i | 22848 | 7.3% |
| r | 21357 | 6.9% |
| s | 21171 | 6.8% |
| h | 16776 | 5.4% |
| l | 13212 | 4.2% |
| Other values (18) | 78341 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1179 | 11.0% |
| B | 856 | 8.0% |
| T | 831 | 7.7% |
| S | 788 | 7.3% |
| W | 620 | 5.8% |
| C | 618 | 5.8% |
| M | 618 | 5.8% |
| H | 500 | 4.7% |
| D | 480 | 4.5% |
| J | 468 | 4.4% |
| Other values (18) | 3768 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3696 | |
| . | 3153 | |
| ' | 932 | 11.3% |
| " | 231 | 2.8% |
| : | 85 | 1.0% |
| ; | 53 | 0.6% |
| ? | 49 | 0.6% |
| ! | 18 | 0.2% |
| / | 9 | 0.1% |
| & | 6 | 0.1% |
| Other values (5) | 7 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 240 | |
| 1 | 221 | |
| 9 | 143 | |
| 2 | 120 | |
| 5 | 65 | 6.2% |
| 8 | 63 | 6.0% |
| 7 | 59 | 5.6% |
| 4 | 51 | 4.8% |
| 3 | 50 | 4.8% |
| 6 | 40 | 3.8% |
Other Symbol
| Value | Count | Frequency (%) |
| ™ | 121 | |
| © | 21 | 13.0% |
| ¦ | 17 | 10.6% |
| ® | 2 | 1.2% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ˜ | 11 | |
| ¯ | 1 | 7.7% |
| ´ | 1 | 7.7% |
Space Separator
| Value | Count | Frequency (%) |
| 66850 | ||
| 4 | < 0.1% |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 277 | |
| $ | 9 | 3.1% |
Other Number
| Value | Count | Frequency (%) |
| ¹ | 1 | |
| ³ | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1190 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 202 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 202 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 81 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 25 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 322132 | |
| Common | 78307 | 19.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 38852 | |
| t | 26765 | 8.3% |
| a | 25985 | 8.1% |
| n | 23189 | 7.2% |
| o | 22910 | 7.1% |
| i | 22848 | 7.1% |
| r | 21357 | 6.6% |
| s | 21171 | 6.6% |
| h | 16776 | 5.2% |
| l | 13212 | 4.1% |
| Other values (46) | 89067 |
Common
| Value | Count | Frequency (%) |
| 66850 | ||
| , | 3696 | 4.7% |
| . | 3153 | 4.0% |
| - | 1190 | 1.5% |
| ' | 932 | 1.2% |
| € | 277 | 0.4% |
| 0 | 240 | 0.3% |
| " | 231 | 0.3% |
| 1 | 221 | 0.3% |
| ( | 202 | 0.3% |
| Other values (33) | 1315 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 399550 | |
| None | 372 | 0.1% |
| Currency Symbols | 277 | 0.1% |
| Letterlike Symbols | 121 | < 0.1% |
| Punctuation | 108 | < 0.1% |
| Modifier Letters | 11 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 66850 | ||
| e | 38852 | 9.7% |
| t | 26765 | 6.7% |
| a | 25985 | 6.5% |
| n | 23189 | 5.8% |
| o | 22910 | 5.7% |
| i | 22848 | 5.7% |
| r | 21357 | 5.3% |
| s | 21171 | 5.3% |
| h | 16776 | 4.2% |
| Other values (69) | 112847 |
None
| Value | Count | Frequency (%) |
| â | 277 | |
| Ã | 25 | 6.7% |
| © | 21 | 5.6% |
| ¦ | 17 | 4.6% |
| œ | 10 | 2.7% |
| Â | 9 | 2.4% |
| 4 | 1.1% | |
| ® | 2 | 0.5% |
| · | 2 | 0.5% |
| ¹ | 1 | 0.3% |
| Other values (4) | 4 | 1.1% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 277 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 121 |
Punctuation
| Value | Count | Frequency (%) |
| “ | 81 | |
| ” | 25 | 23.1% |
| • | 2 | 1.9% |
Modifier Letters
| Value | Count | Frequency (%) |
| ˜ | 11 |
runtime
Real number (ℝ)
| Distinct | 102 |
|---|---|
| Distinct (%) | 7.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 110.2735 |
| Minimum | 63 |
|---|---|
| Maximum | 201 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.2 KiB |
Quantile statistics
| Minimum | 63 |
|---|---|
| 5-th percentile | 87 |
| Q1 | 97 |
| median | 107 |
| Q3 | 121 |
| 95-th percentile | 145 |
| Maximum | 201 |
| Range | 138 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 18.811369 |
|---|---|
| Coefficient of variation (CV) | 0.17058829 |
| Kurtosis | 1.7660891 |
| Mean | 110.2735 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 1.0789244 |
| Sum | 141922 |
| Variance | 353.86759 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 44 | 3.4% |
| 90 | 39 | 3.0% |
| 102 | 34 | 2.6% |
| 97 | 34 | 2.6% |
| 109 | 33 | 2.6% |
| 108 | 33 | 2.6% |
| 106 | 33 | 2.6% |
| 98 | 30 | 2.3% |
| 95 | 30 | 2.3% |
| 107 | 30 | 2.3% |
| Other values (92) | 947 |
| Value | Count | Frequency (%) |
| 63 | 1 | 0.1% |
| 75 | 1 | 0.1% |
| 76 | 1 | 0.1% |
| 77 | 1 | 0.1% |
| 79 | 3 | 0.2% |
| 80 | 6 | |
| 81 | 6 | |
| 82 | 6 | |
| 83 | 8 | |
| 84 | 8 |
| Value | Count | Frequency (%) |
| 201 | 1 | |
| 195 | 1 | |
| 194 | 1 | |
| 189 | 1 | |
| 188 | 1 | |
| 180 | 1 | |
| 179 | 1 | |
| 178 | 2 | |
| 175 | 1 | |
| 172 | 1 |
genres
Categorical
| Distinct | 496 |
|---|---|
| Distinct (%) | 38.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.2 KiB |
| Drama | 76 |
|---|---|
| Comedy | 69 |
| Drama|Romance | 37 |
| Comedy|Romance | 30 |
| Comedy|Drama|Romance | 29 |
| Other values (491) |
Length
| Max length | 49 |
|---|---|
| Median length | 41 |
| Mean length | 20.586636 |
| Min length | 5 |
Characters and Unicode
| Total characters | 26495 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 327 ? |
|---|---|
| Unique (%) | 25.4% |
Sample
| 1st row | Action|Adventure|Science Fiction|Thriller |
|---|---|
| 2nd row | Action|Adventure|Science Fiction|Thriller |
| 3rd row | Adventure|Science Fiction|Thriller |
| 4th row | Action|Adventure|Science Fiction|Fantasy |
| 5th row | Action|Crime|Thriller |
Common Values
| Value | Count | Frequency (%) |
| Drama | 76 | 5.9% |
| Comedy | 69 | 5.4% |
| Drama|Romance | 37 | 2.9% |
| Comedy|Romance | 30 | 2.3% |
| Comedy|Drama|Romance | 29 | 2.3% |
| Horror|Thriller | 28 | 2.2% |
| Comedy|Drama | 23 | 1.8% |
| Adventure|Action|Thriller | 20 | 1.6% |
| Drama|Thriller | 18 | 1.4% |
| Horror | 17 | 1.3% |
| Other values (486) | 940 |
Length
| Value | Count | Frequency (%) |
| fiction | 112 | 7.5% |
| drama | 76 | 5.1% |
| comedy | 69 | 4.6% |
| science | 38 | 2.5% |
| drama|romance | 37 | 2.5% |
| comedy|romance | 30 | 2.0% |
| comedy|drama|romance | 29 | 1.9% |
| horror|thriller | 28 | 1.9% |
| fiction|thriller | 26 | 1.7% |
| comedy|drama | 23 | 1.5% |
| Other values (470) | 1028 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 2434 | 9.2% |
| e | 2354 | 8.9% |
| | | 2167 | 8.2% |
| i | 2087 | 7.9% |
| a | 1892 | 7.1% |
| n | 1725 | 6.5% |
| o | 1671 | 6.3% |
| m | 1624 | 6.1% |
| t | 1344 | 5.1% |
| c | 1291 | 4.9% |
| Other values (19) | 7906 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20456 | |
| Uppercase Letter | 3663 | 13.8% |
| Math Symbol | 2167 | 8.2% |
| Space Separator | 209 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 2434 | |
| e | 2354 | |
| i | 2087 | |
| a | 1892 | |
| n | 1725 | |
| o | 1671 | |
| m | 1624 | |
| t | 1344 | |
| c | 1291 | |
| y | 977 | 4.8% |
| Other values (7) | 3057 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 820 | |
| C | 607 | |
| D | 550 | |
| F | 527 | |
| T | 399 | |
| S | 209 | 5.7% |
| R | 196 | 5.4% |
| H | 174 | 4.8% |
| M | 136 | 3.7% |
| W | 45 | 1.2% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 2167 |
Space Separator
| Value | Count | Frequency (%) |
| 209 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 24119 | |
| Common | 2376 | 9.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 2434 | 10.1% |
| e | 2354 | 9.8% |
| i | 2087 | 8.7% |
| a | 1892 | 7.8% |
| n | 1725 | 7.2% |
| o | 1671 | 6.9% |
| m | 1624 | 6.7% |
| t | 1344 | 5.6% |
| c | 1291 | 5.4% |
| y | 977 | 4.1% |
| Other values (17) | 6720 |
Common
| Value | Count | Frequency (%) |
| | | 2167 | |
| 209 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26495 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 2434 | 9.2% |
| e | 2354 | 8.9% |
| | | 2167 | 8.2% |
| i | 2087 | 7.9% |
| a | 1892 | 7.1% |
| n | 1725 | 6.5% |
| o | 1671 | 6.3% |
| m | 1624 | 6.1% |
| t | 1344 | 5.1% |
| c | 1291 | 4.9% |
| Other values (19) | 7906 |
production_companies
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 1138 |
|---|---|
| Distinct (%) | 88.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.2 KiB |
| Walt Disney Pictures|Pixar Animation Studios | 12 |
|---|---|
| DreamWorks Animation | 10 |
| Eon Productions | 9 |
| Marvel Studios | 8 |
| Paramount Pictures | 7 |
| Other values (1133) |
Length
| Max length | 172 |
|---|---|
| Median length | 104 |
| Mean length | 60.480186 |
| Min length | 3 |
Characters and Unicode
| Total characters | 77838 |
|---|---|
| Distinct characters | 84 |
| Distinct categories | 14 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 1081 ? |
|---|---|
| Unique (%) | 84.0% |
Sample
| 1st row | Universal Studios|Amblin Entertainment|Legendary Pictures|Fuji Television Network|Dentsu |
|---|---|
| 2nd row | Village Roadshow Pictures|Kennedy Miller Productions |
| 3rd row | Summit Entertainment|Mandeville Films|Red Wagon Entertainment|NeoReel |
| 4th row | Lucasfilm|Truenorth Productions|Bad Robot |
| 5th row | Universal Pictures|Original Film|Media Rights Capital|Dentsu|One Race Films |
Common Values
| Value | Count | Frequency (%) |
| Walt Disney Pictures|Pixar Animation Studios | 12 | 0.9% |
| DreamWorks Animation | 10 | 0.8% |
| Eon Productions | 9 | 0.7% |
| Marvel Studios | 8 | 0.6% |
| Paramount Pictures | 7 | 0.5% |
| New Line Cinema | 7 | 0.5% |
| Universal Pictures | 7 | 0.5% |
| Columbia Pictures | 6 | 0.5% |
| Walt Disney Pictures|Walt Disney Animation Studios | 6 | 0.5% |
| Eon Productions|Metro-Goldwyn-Mayer (MGM) | 6 | 0.5% |
| Other values (1128) | 1209 |
Length
| Value | Count | Frequency (%) |
| productions | 257 | 3.6% |
| pictures | 208 | 2.9% |
| films | 191 | 2.7% |
| entertainment | 183 | 2.6% |
| film | 166 | 2.3% |
| universal | 109 | 1.5% |
| columbia | 87 | 1.2% |
| fox | 86 | 1.2% |
| disney | 78 | 1.1% |
| paramount | 72 | 1.0% |
| Other values (2837) | 5653 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 6092 | 7.8% |
| e | 6077 | 7.8% |
| 5803 | 7.5% | |
| n | 5689 | 7.3% |
| t | 5646 | 7.3% |
| r | 5055 | 6.5% |
| a | 4380 | 5.6% |
| o | 4275 | 5.5% |
| s | 3740 | 4.8% |
| | | 2763 | 3.5% |
| Other values (74) | 28318 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 57656 | |
| Uppercase Letter | 10671 | 13.7% |
| Space Separator | 5803 | 7.5% |
| Math Symbol | 2789 | 3.6% |
| Other Punctuation | 318 | 0.4% |
| Decimal Number | 287 | 0.4% |
| Dash Punctuation | 110 | 0.1% |
| Close Punctuation | 85 | 0.1% |
| Open Punctuation | 85 | 0.1% |
| Other Symbol | 30 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1832 | |
| F | 1130 | 10.6% |
| C | 902 | 8.5% |
| E | 785 | 7.4% |
| M | 700 | 6.6% |
| S | 669 | 6.3% |
| W | 470 | 4.4% |
| B | 468 | 4.4% |
| D | 457 | 4.3% |
| A | 392 | 3.7% |
| Other values (17) | 2866 |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 6092 | |
| e | 6077 | |
| n | 5689 | |
| t | 5646 | |
| r | 5055 | |
| a | 4380 | 7.6% |
| o | 4275 | 7.4% |
| s | 3740 | 6.5% |
| u | 2721 | 4.7% |
| l | 2583 | 4.5% |
| Other values (16) | 11398 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 67 | |
| 2 | 66 | |
| 1 | 40 | |
| 4 | 30 | |
| 3 | 26 | 9.1% |
| 9 | 19 | 6.6% |
| 6 | 16 | 5.6% |
| 8 | 11 | 3.8% |
| 7 | 9 | 3.1% |
| 5 | 3 | 1.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 205 | |
| / | 46 | 14.5% |
| & | 26 | 8.2% |
| ' | 17 | 5.3% |
| , | 17 | 5.3% |
| ‰ | 2 | 0.6% |
| : | 2 | 0.6% |
| " | 2 | 0.6% |
| ¶ | 1 | 0.3% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 2763 | |
| + | 24 | 0.9% |
| ± | 2 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 5803 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 110 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 85 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 85 |
Other Symbol
| Value | Count | Frequency (%) |
| © | 30 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¤ | 1 |
Format
| Value | Count | Frequency (%) |
| | 1 |
Other Number
| Value | Count | Frequency (%) |
| ³ | 1 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ¯ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 68327 | |
| Common | 9511 | 12.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 6092 | 8.9% |
| e | 6077 | 8.9% |
| n | 5689 | 8.3% |
| t | 5646 | 8.3% |
| r | 5055 | 7.4% |
| a | 4380 | 6.4% |
| o | 4275 | 6.3% |
| s | 3740 | 5.5% |
| u | 2721 | 4.0% |
| l | 2583 | 3.8% |
| Other values (43) | 22069 |
Common
| Value | Count | Frequency (%) |
| 5803 | ||
| | | 2763 | |
| . | 205 | 2.2% |
| - | 110 | 1.2% |
| ) | 85 | 0.9% |
| ( | 85 | 0.9% |
| 0 | 67 | 0.7% |
| 2 | 66 | 0.7% |
| / | 46 | 0.5% |
| 1 | 40 | 0.4% |
| Other values (21) | 241 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 77760 | |
| None | 76 | 0.1% |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 6092 | 7.8% |
| e | 6077 | 7.8% |
| 5803 | 7.5% | |
| n | 5689 | 7.3% |
| t | 5646 | 7.3% |
| r | 5055 | 6.5% |
| a | 4380 | 5.6% |
| o | 4275 | 5.5% |
| s | 3740 | 4.8% |
| | | 2763 | 3.6% |
| Other values (65) | 28240 |
None
| Value | Count | Frequency (%) |
| Ã | 39 | |
| © | 30 | |
| ± | 2 | 2.6% |
| ¤ | 1 | 1.3% |
| | 1 | 1.3% |
| ¶ | 1 | 1.3% |
| ³ | 1 | 1.3% |
| ¯ | 1 | 1.3% |
Punctuation
| Value | Count | Frequency (%) |
| ‰ | 2 |
release_date
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 1080 |
|---|---|
| Distinct (%) | 83.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.2 KiB |
| 2011-09-30 | 5 |
|---|---|
| 2014-12-25 | 5 |
| 2011-09-16 | 4 |
| 2009-03-19 | 4 |
| 2007-09-06 | 4 |
| Other values (1075) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 12870 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 906 ? |
|---|---|
| Unique (%) | 70.4% |
Sample
| 1st row | 2015-06-09 |
|---|---|
| 2nd row | 2015-05-13 |
| 3rd row | 2015-03-18 |
| 4th row | 2015-12-15 |
| 5th row | 2015-04-01 |
Common Values
| Value | Count | Frequency (%) |
| 2011-09-30 | 5 | 0.4% |
| 2014-12-25 | 5 | 0.4% |
| 2011-09-16 | 4 | 0.3% |
| 2009-03-19 | 4 | 0.3% |
| 2007-09-06 | 4 | 0.3% |
| 2011-04-08 | 3 | 0.2% |
| 2010-09-11 | 3 | 0.2% |
| 2012-03-12 | 3 | 0.2% |
| 2012-09-07 | 3 | 0.2% |
| 2015-11-20 | 3 | 0.2% |
| Other values (1070) | 1250 |
Length
| Value | Count | Frequency (%) |
| 2011-09-30 | 5 | 0.4% |
| 2014-12-25 | 5 | 0.4% |
| 2011-09-16 | 4 | 0.3% |
| 2009-03-19 | 4 | 0.3% |
| 2007-09-06 | 4 | 0.3% |
| 2005-09-16 | 3 | 0.2% |
| 2011-09-22 | 3 | 0.2% |
| 2009-10-10 | 3 | 0.2% |
| 2012-01-19 | 3 | 0.2% |
| 2015-12-25 | 3 | 0.2% |
| Other values (1070) | 1250 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3410 | |
| - | 2574 | |
| 1 | 2092 | |
| 2 | 2012 | |
| 9 | 640 | 5.0% |
| 5 | 399 | 3.1% |
| 3 | 386 | 3.0% |
| 8 | 355 | 2.8% |
| 7 | 341 | 2.6% |
| 4 | 331 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10296 | |
| Dash Punctuation | 2574 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3410 | |
| 1 | 2092 | |
| 2 | 2012 | |
| 9 | 640 | 6.2% |
| 5 | 399 | 3.9% |
| 3 | 386 | 3.7% |
| 8 | 355 | 3.4% |
| 7 | 341 | 3.3% |
| 4 | 331 | 3.2% |
| 6 | 330 | 3.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2574 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 12870 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3410 | |
| - | 2574 | |
| 1 | 2092 | |
| 2 | 2012 | |
| 9 | 640 | 5.0% |
| 5 | 399 | 3.1% |
| 3 | 386 | 3.0% |
| 8 | 355 | 2.8% |
| 7 | 341 | 2.6% |
| 4 | 331 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12870 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3410 | |
| - | 2574 | |
| 1 | 2092 | |
| 2 | 2012 | |
| 9 | 640 | 5.0% |
| 5 | 399 | 3.1% |
| 3 | 386 | 3.0% |
| 8 | 355 | 2.8% |
| 7 | 341 | 2.6% |
| 4 | 331 | 2.6% |
vote_count
Real number (ℝ)
| Distinct | 894 |
|---|---|
| Distinct (%) | 69.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 947.26651 |
| Minimum | 10 |
|---|---|
| Maximum | 9767 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.2 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 42 |
| Q1 | 179 |
| median | 439 |
| Q3 | 1173 |
| 95-th percentile | 3557.5 |
| Maximum | 9767 |
| Range | 9757 |
| Interquartile range (IQR) | 994 |
Descriptive statistics
| Standard deviation | 1255.4762 |
|---|---|
| Coefficient of variation (CV) | 1.3253675 |
| Kurtosis | 8.6303039 |
| Mean | 947.26651 |
| Median Absolute Deviation (MAD) | 341 |
| Skewness | 2.580723 |
| Sum | 1219132 |
| Variance | 1576220.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 130 | 7 | 0.5% |
| 63 | 6 | 0.5% |
| 78 | 6 | 0.5% |
| 205 | 6 | 0.5% |
| 423 | 6 | 0.5% |
| 12 | 5 | 0.4% |
| 51 | 5 | 0.4% |
| 58 | 5 | 0.4% |
| 96 | 5 | 0.4% |
| 151 | 5 | 0.4% |
| Other values (884) | 1231 |
| Value | Count | Frequency (%) |
| 10 | 1 | 0.1% |
| 11 | 2 | 0.2% |
| 12 | 5 | |
| 13 | 1 | 0.1% |
| 14 | 1 | 0.1% |
| 15 | 1 | 0.1% |
| 16 | 4 | |
| 18 | 3 | |
| 19 | 2 | 0.2% |
| 20 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 9767 | 1 | |
| 8903 | 1 | |
| 8458 | 1 | |
| 8432 | 1 | |
| 7375 | 1 | |
| 7080 | 1 | |
| 6882 | 1 | |
| 6723 | 1 | |
| 6498 | 1 | |
| 6417 | 1 |
vote_average
Real number (ℝ)
| Distinct | 48 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.2794872 |
| Minimum | 2.2 |
|---|---|
| Maximum | 8.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.2 KiB |
Quantile statistics
| Minimum | 2.2 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 5.8 |
| median | 6.3 |
| Q3 | 6.8 |
| 95-th percentile | 7.6 |
| Maximum | 8.3 |
| Range | 6.1 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.7959552 |
|---|---|
| Coefficient of variation (CV) | 0.12675481 |
| Kurtosis | 0.53935859 |
| Mean | 6.2794872 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | -0.3003751 |
| Sum | 8081.7 |
| Variance | 0.63354468 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.5 | 69 | 5.4% |
| 6.3 | 65 | 5.1% |
| 5.9 | 65 | 5.1% |
| 6 | 62 | 4.8% |
| 6.1 | 62 | 4.8% |
| 6.2 | 62 | 4.8% |
| 6.6 | 61 | 4.7% |
| 6.9 | 59 | 4.6% |
| 5.8 | 58 | 4.5% |
| 6.4 | 58 | 4.5% |
| Other values (38) | 666 |
| Value | Count | Frequency (%) |
| 2.2 | 1 | 0.1% |
| 3.3 | 1 | 0.1% |
| 3.4 | 1 | 0.1% |
| 3.8 | 4 | |
| 3.9 | 2 | 0.2% |
| 4 | 1 | 0.1% |
| 4.2 | 3 | |
| 4.3 | 2 | 0.2% |
| 4.4 | 7 | |
| 4.5 | 5 |
| Value | Count | Frequency (%) |
| 8.3 | 1 | 0.1% |
| 8.2 | 1 | 0.1% |
| 8.1 | 3 | 0.2% |
| 8 | 8 | 0.6% |
| 7.9 | 9 | 0.7% |
| 7.8 | 14 | |
| 7.7 | 14 | |
| 7.6 | 25 | |
| 7.5 | 20 | |
| 7.4 | 17 |
release_year
Real number (ℝ)
| Distinct | 51 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2007.0171 |
| Minimum | 1961 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.2 KiB |
Quantile statistics
| Minimum | 1961 |
|---|---|
| 5-th percentile | 1991 |
| Q1 | 2005 |
| median | 2009 |
| Q3 | 2011 |
| 95-th percentile | 2015 |
| Maximum | 2015 |
| Range | 54 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 8.0605033 |
|---|---|
| Coefficient of variation (CV) | 0.0040161608 |
| Kurtosis | 8.1056485 |
| Mean | 2007.0171 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -2.5419022 |
| Sum | 2583031 |
| Variance | 64.971714 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2011 | 156 | |
| 2010 | 132 | |
| 2009 | 116 | 9.0% |
| 2015 | 93 | 7.2% |
| 2007 | 92 | 7.1% |
| 2012 | 88 | 6.8% |
| 2008 | 82 | 6.4% |
| 2014 | 70 | 5.4% |
| 2006 | 68 | 5.3% |
| 2013 | 65 | 5.1% |
| Other values (41) | 325 |
| Value | Count | Frequency (%) |
| 1961 | 1 | 0.1% |
| 1962 | 1 | 0.1% |
| 1963 | 1 | 0.1% |
| 1964 | 2 | |
| 1965 | 1 | 0.1% |
| 1967 | 1 | 0.1% |
| 1969 | 1 | 0.1% |
| 1971 | 4 | |
| 1972 | 1 | 0.1% |
| 1973 | 2 |
| Value | Count | Frequency (%) |
| 2015 | 93 | |
| 2014 | 70 | |
| 2013 | 65 | |
| 2012 | 88 | |
| 2011 | 156 | |
| 2010 | 132 | |
| 2009 | 116 | |
| 2008 | 82 | |
| 2007 | 92 | |
| 2006 | 68 |
budget_adj
Real number (ℝ)
| Distinct | 835 |
|---|---|
| Distinct (%) | 64.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54629936 |
| Minimum | 0.96939804 |
|---|---|
| Maximum | 4.25 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.2 KiB |
Quantile statistics
| Minimum | 0.96939804 |
|---|---|
| 5-th percentile | 2253188.3 |
| Q1 | 15191800 |
| median | 35569267 |
| Q3 | 76301250 |
| 95-th percentile | 1.6803223 × 108 |
| Maximum | 4.25 × 108 |
| Range | 4.25 × 108 |
| Interquartile range (IQR) | 61109451 |
Descriptive statistics
| Standard deviation | 55254628 |
|---|---|
| Coefficient of variation (CV) | 1.011435 |
| Kurtosis | 3.7592508 |
| Mean | 54629936 |
| Median Absolute Deviation (MAD) | 25449272 |
| Skewness | 1.7152717 |
| Sum | 7.0308727 × 1010 |
| Variance | 3.0530739 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 38775921.7 | 9 | 0.7% |
| 20328008.68 | 8 | 0.6% |
| 20000000 | 8 | 0.6% |
| 40656017.36 | 8 | 0.6% |
| 29081941.28 | 8 | 0.6% |
| 48469902.13 | 7 | 0.5% |
| 24234951.06 | 7 | 0.5% |
| 26291714.57 | 6 | 0.5% |
| 21033371.65 | 6 | 0.5% |
| 60767198.03 | 6 | 0.5% |
| Other values (825) | 1214 |
| Value | Count | Frequency (%) |
| 0.9693980426 | 1 | |
| 3 | 1 | |
| 50.06695621 | 1 | |
| 76.23003256 | 1 | |
| 82.43377477 | 1 | |
| 90.15401796 | 1 | |
| 7755.184341 | 1 | |
| 8081.117799 | 1 | |
| 15775.02874 | 1 | |
| 16479.76672 | 1 |
| Value | Count | Frequency (%) |
| 425000000 | 1 | |
| 368371256.2 | 1 | |
| 315500574.8 | 1 | |
| 271692064.2 | 1 | |
| 271330494.3 | 1 | |
| 260000000 | 1 | |
| 257599886.7 | 1 | |
| 254100108.5 | 1 | |
| 250000000 | 1 | |
| 246933513.2 | 1 |
revenue_adj
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 1287 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.991775 × 108 |
| Minimum | 43 |
|---|---|
| Maximum | 2.8271238 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.2 KiB |
Quantile statistics
| Minimum | 43 |
|---|---|
| 5-th percentile | 712674.93 |
| Q1 | 27648902 |
| median | 86747696 |
| Q3 | 2.3511781 × 108 |
| 95-th percentile | 7.8477998 × 108 |
| Maximum | 2.8271238 × 109 |
| Range | 2.8271237 × 109 |
| Interquartile range (IQR) | 2.074689 × 108 |
Descriptive statistics
| Standard deviation | 2.9685146 × 108 |
|---|---|
| Coefficient of variation (CV) | 1.4903865 |
| Kurtosis | 17.129028 |
| Mean | 1.991775 × 108 |
| Median Absolute Deviation (MAD) | 75738109 |
| Skewness | 3.3456612 |
| Sum | 2.5634144 × 1011 |
| Variance | 8.8120791 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1392445893 | 1 | 0.1% |
| 228089879.1 | 1 | 0.1% |
| 123500625.3 | 1 | 0.1% |
| 117506997.8 | 1 | 0.1% |
| 497843379.2 | 1 | 0.1% |
| 285166475.4 | 1 | 0.1% |
| 297658738.2 | 1 | 0.1% |
| 92046987.94 | 1 | 0.1% |
| 908665501.9 | 1 | 0.1% |
| 44322350.23 | 1 | 0.1% |
| Other values (1277) | 1277 |
| Value | Count | Frequency (%) |
| 43 | 1 | |
| 48.3767548 | 1 | |
| 136.1976582 | 1 | |
| 233.966449 | 1 | |
| 333.7797081 | 1 | |
| 1335.830503 | 1 | |
| 7425.821572 | 1 | |
| 13881.76323 | 1 | |
| 18393.79866 | 1 | |
| 29706.10748 | 1 |
| Value | Count | Frequency (%) |
| 2827123750 | 1 | |
| 2789712242 | 1 | |
| 2506405735 | 1 | |
| 2167324901 | 1 | |
| 1907005842 | 1 | |
| 1902723130 | 1 | |
| 1791694309 | 1 | |
| 1443191435 | 1 | |
| 1424626188 | 1 | |
| 1392445893 | 1 |
profit
Real number (ℝ)
| Distinct | 1283 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.2424095 × 108 |
| Minimum | -4.1391243 × 108 |
|---|---|
| Maximum | 2.5445058 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 265 |
| Negative (%) | 20.6% |
| Memory size | 10.2 KiB |
Quantile statistics
| Minimum | -4.1391243 × 108 |
|---|---|
| 5-th percentile | -15487781 |
| Q1 | 3142641 |
| median | 45243000 |
| Q3 | 1.4700697 × 108 |
| 95-th percentile | 5.6667045 × 108 |
| Maximum | 2.5445058 × 109 |
| Range | 2.9584183 × 109 |
| Interquartile range (IQR) | 1.4386433 × 108 |
Descriptive statistics
| Standard deviation | 2.183462 × 108 |
|---|---|
| Coefficient of variation (CV) | 1.7574415 |
| Kurtosis | 20.909727 |
| Mean | 1.2424095 × 108 |
| Median Absolute Deviation (MAD) | 49159401 |
| Skewness | 3.5496889 |
| Sum | 1.598981 × 1011 |
| Variance | 4.7675063 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 102000000 | 2 | 0.2% |
| 44000000 | 2 | 0.2% |
| 2000000 | 2 | 0.2% |
| -14000000 | 2 | 0.2% |
| 1363528810 | 1 | 0.1% |
| 24351251 | 1 | 0.1% |
| 212654182 | 1 | 0.1% |
| 305000141 | 1 | 0.1% |
| 60337295 | 1 | 0.1% |
| 894761885 | 1 | 0.1% |
| Other values (1273) | 1273 |
| Value | Count | Frequency (%) |
| -413912431 | 1 | |
| -165710090 | 1 | |
| -111007242 | 1 | |
| -84540684 | 1 | |
| -74010360 | 1 | |
| -71331093 | 1 | |
| -68351500 | 1 | |
| -64926294 | 1 | |
| -61900000 | 1 | |
| -61403089 | 1 |
| Value | Count | Frequency (%) |
| 2544505847 | 1 | |
| 1868178225 | 1 | |
| 1645034188 | 1 | |
| 1363528810 | 1 | |
| 1316249360 | 1 | |
| 1299557910 | 1 | |
| 1202817822 | 1 | |
| 1125035767 | 1 | |
| 1124219009 | 1 | |
| 1082730962 | 1 |
popularity_level
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Memory size | 10.2 KiB |
| High | |
|---|---|
| Medium | |
| Moderately High | |
| Low |
Length
| Max length | 15 |
|---|---|
| Median length | 6 |
| Mean length | 6.9968896 |
| Min length | 3 |
Characters and Unicode
| Total characters | 8998 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | High |
|---|---|
| 2nd row | High |
| 3rd row | High |
| 4th row | High |
| 5th row | High |
Common Values
| Value | Count | Frequency (%) |
| High | 322 | |
| Medium | 322 | |
| Moderately High | 321 | |
| Low | 321 | |
| (Missing) | 1 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| high | 643 | |
| medium | 322 | |
| moderately | 321 | |
| low | 321 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 965 | 10.7% |
| e | 964 | 10.7% |
| H | 643 | 7.1% |
| g | 643 | 7.1% |
| h | 643 | 7.1% |
| M | 643 | 7.1% |
| d | 643 | 7.1% |
| o | 642 | 7.1% |
| m | 322 | 3.6% |
| u | 322 | 3.6% |
| Other values (8) | 2568 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7070 | |
| Uppercase Letter | 1607 | 17.9% |
| Space Separator | 321 | 3.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 965 | |
| e | 964 | |
| g | 643 | |
| h | 643 | |
| d | 643 | |
| o | 642 | |
| m | 322 | 4.6% |
| u | 322 | 4.6% |
| r | 321 | 4.5% |
| a | 321 | 4.5% |
| Other values (4) | 1284 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 643 | |
| M | 643 | |
| L | 321 |
Space Separator
| Value | Count | Frequency (%) |
| 321 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8677 | |
| Common | 321 | 3.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 965 | |
| e | 964 | |
| H | 643 | 7.4% |
| g | 643 | 7.4% |
| h | 643 | 7.4% |
| M | 643 | 7.4% |
| d | 643 | 7.4% |
| o | 642 | 7.4% |
| m | 322 | 3.7% |
| u | 322 | 3.7% |
| Other values (7) | 2247 |
Common
| Value | Count | Frequency (%) |
| 321 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8998 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 965 | 10.7% |
| e | 964 | 10.7% |
| H | 643 | 7.1% |
| g | 643 | 7.1% |
| h | 643 | 7.1% |
| M | 643 | 7.1% |
| d | 643 | 7.1% |
| o | 642 | 7.1% |
| m | 322 | 3.6% |
| u | 322 | 3.6% |
| Other values (8) | 2568 |
| Unnamed: 0 | id | popularity | budget | revenue | runtime | vote_count | vote_average | release_year | budget_adj | revenue_adj | profit | popularity_level | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Unnamed: 0 | 1.000 | -0.575 | -0.170 | -0.095 | -0.034 | 0.021 | -0.119 | 0.067 | -0.630 | -0.001 | 0.044 | -0.020 | 0.204 |
| id | -0.575 | 1.000 | 0.042 | -0.035 | -0.117 | -0.101 | 0.028 | -0.137 | 0.909 | -0.145 | -0.198 | -0.112 | 0.150 |
| popularity | -0.170 | 0.042 | 1.000 | 0.507 | 0.690 | 0.313 | 0.845 | 0.367 | 0.154 | 0.525 | 0.683 | 0.648 | 0.364 |
| budget | -0.095 | -0.035 | 0.507 | 1.000 | 0.743 | 0.315 | 0.578 | -0.004 | 0.099 | 0.979 | 0.699 | 0.499 | 0.286 |
| revenue | -0.034 | -0.117 | 0.690 | 0.743 | 1.000 | 0.327 | 0.788 | 0.245 | 0.008 | 0.766 | 0.985 | 0.929 | 0.330 |
| runtime | 0.021 | -0.101 | 0.313 | 0.315 | 0.327 | 1.000 | 0.335 | 0.357 | -0.036 | 0.343 | 0.337 | 0.271 | 0.189 |
| vote_count | -0.119 | 0.028 | 0.845 | 0.578 | 0.788 | 0.335 | 1.000 | 0.448 | 0.175 | 0.579 | 0.762 | 0.746 | 0.441 |
| vote_average | 0.067 | -0.137 | 0.367 | -0.004 | 0.245 | 0.357 | 0.448 | 1.000 | -0.101 | 0.029 | 0.265 | 0.330 | 0.264 |
| release_year | -0.630 | 0.909 | 0.154 | 0.099 | 0.008 | -0.036 | 0.175 | -0.101 | 1.000 | -0.033 | -0.096 | -0.007 | 0.117 |
| budget_adj | -0.001 | -0.145 | 0.525 | 0.979 | 0.766 | 0.343 | 0.579 | 0.029 | -0.033 | 1.000 | 0.751 | 0.529 | 0.284 |
| revenue_adj | 0.044 | -0.198 | 0.683 | 0.699 | 0.985 | 0.337 | 0.762 | 0.265 | -0.096 | 0.751 | 1.000 | 0.925 | 0.327 |
| profit | -0.020 | -0.112 | 0.648 | 0.499 | 0.929 | 0.271 | 0.746 | 0.330 | -0.007 | 0.529 | 0.925 | 1.000 | 0.332 |
| popularity_level | 0.204 | 0.150 | 0.364 | 0.286 | 0.330 | 0.189 | 0.441 | 0.264 | 0.117 | 0.284 | 0.327 | 0.332 | 1.000 |
| Unnamed: 0 | id | imdb_id | popularity | budget | revenue | original_title | cast | homepage | director | tagline | keywords | overview | runtime | genres | production_companies | release_date | vote_count | vote_average | release_year | budget_adj | revenue_adj | profit | popularity_level | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 135397 | tt0369610 | 32.985763 | 150000000.0 | 1.513529e+09 | Jurassic World | Chris Pratt|Bryce Dallas Howard|Irrfan Khan|Vincent D'Onofrio|Nick Robinson | http://www.jurassicworld.com/ | Colin Trevorrow | The park is open. | monster|dna|tyrannosaurus rex|velociraptor|island | Twenty-two years after the events of Jurassic Park, Isla Nublar now features a fully functioning dinosaur theme park, Jurassic World, as originally envisioned by John Hammond. | 124 | Action|Adventure|Science Fiction|Thriller | Universal Studios|Amblin Entertainment|Legendary Pictures|Fuji Television Network|Dentsu | 2015-06-09 | 5562 | 6.5 | 2015 | 1.379999e+08 | 1.392446e+09 | 1.363529e+09 | High |
| 1 | 1 | 76341 | tt1392190 | 28.419936 | 150000000.0 | 3.784364e+08 | Mad Max: Fury Road | Tom Hardy|Charlize Theron|Hugh Keays-Byrne|Nicholas Hoult|Josh Helman | http://www.madmaxmovie.com/ | George Miller | What a Lovely Day. | future|chase|post-apocalyptic|dystopia|australia | An apocalyptic story set in the furthest reaches of our planet, in a stark desert landscape where humanity is broken, and most everyone is crazed fighting for the necessities of life. Within this world exist two rebels on the run who just might be able to restore order. There's Max, a man of action and a man of few words, who seeks peace of mind following the loss of his wife and child in the aftermath of the chaos. And Furiosa, a woman of action and a woman who believes her path to survival may be achieved if she can make it across the desert back to her childhood homeland. | 120 | Action|Adventure|Science Fiction|Thriller | Village Roadshow Pictures|Kennedy Miller Productions | 2015-05-13 | 6185 | 7.1 | 2015 | 1.379999e+08 | 3.481613e+08 | 2.284364e+08 | High |
| 2 | 2 | 262500 | tt2908446 | 13.112507 | 110000000.0 | 2.952382e+08 | Insurgent | Shailene Woodley|Theo James|Kate Winslet|Ansel Elgort|Miles Teller | http://www.thedivergentseries.movie/#insurgent | Robert Schwentke | One Choice Can Destroy You | based on novel|revolution|dystopia|sequel|dystopic future | Beatrice Prior must confront her inner demons and continue her fight against a powerful alliance which threatens to tear her society apart. | 119 | Adventure|Science Fiction|Thriller | Summit Entertainment|Mandeville Films|Red Wagon Entertainment|NeoReel | 2015-03-18 | 2480 | 6.3 | 2015 | 1.012000e+08 | 2.716190e+08 | 1.852382e+08 | High |
| 3 | 3 | 140607 | tt2488496 | 11.173104 | 200000000.0 | 2.068178e+09 | Star Wars: The Force Awakens | Harrison Ford|Mark Hamill|Carrie Fisher|Adam Driver|Daisy Ridley | http://www.starwars.com/films/star-wars-episode-vii | J.J. Abrams | Every generation has a story. | android|spaceship|jedi|space opera|3d | Thirty years after defeating the Galactic Empire, Han Solo and his allies face a new threat from the evil Kylo Ren and his army of Stormtroopers. | 136 | Action|Adventure|Science Fiction|Fantasy | Lucasfilm|Truenorth Productions|Bad Robot | 2015-12-15 | 5292 | 7.5 | 2015 | 1.839999e+08 | 1.902723e+09 | 1.868178e+09 | High |
| 4 | 4 | 168259 | tt2820852 | 9.335014 | 190000000.0 | 1.506249e+09 | Furious 7 | Vin Diesel|Paul Walker|Jason Statham|Michelle Rodriguez|Dwayne Johnson | http://www.furious7.com/ | James Wan | Vengeance Hits Home | car race|speed|revenge|suspense|car | Deckard Shaw seeks revenge against Dominic Toretto and his family for his comatose brother. | 137 | Action|Crime|Thriller | Universal Pictures|Original Film|Media Rights Capital|Dentsu|One Race Films | 2015-04-01 | 2947 | 7.3 | 2015 | 1.747999e+08 | 1.385749e+09 | 1.316249e+09 | High |
| 5 | 5 | 281957 | tt1663202 | 9.110700 | 135000000.0 | 5.329505e+08 | The Revenant | Leonardo DiCaprio|Tom Hardy|Will Poulter|Domhnall Gleeson|Paul Anderson | http://www.foxmovies.com/movies/the-revenant | Alejandro González Iñárritu | (n. One who has returned, as if from the dead.) | father-son relationship|rape|based on novel|mountains|winter | In the 1820s, a frontiersman, Hugh Glass, sets out on a path of vengeance against those who left him for dead after a bear mauling. | 156 | Western|Drama|Adventure|Thriller | Regency Enterprises|Appian Way|CatchPlay|Anonymous Content|New Regency Pictures | 2015-12-25 | 3929 | 7.2 | 2015 | 1.241999e+08 | 4.903142e+08 | 3.979505e+08 | High |
| 6 | 6 | 87101 | tt1340138 | 8.654359 | 155000000.0 | 4.406035e+08 | Terminator Genisys | Arnold Schwarzenegger|Jason Clarke|Emilia Clarke|Jai Courtney|J.K. Simmons | http://www.terminatormovie.com/ | Alan Taylor | Reset the future | saving the world|artificial intelligence|cyborg|killer robot|future | The year is 2029. John Connor, leader of the resistance continues the war against the machines. At the Los Angeles offensive, John's fears of the unknown future begin to emerge when TECOM spies reveal a new plot by SkyNet that will attack him from both fronts; past and future, and will ultimately change warfare forever. | 125 | Science Fiction|Action|Thriller|Adventure | Paramount Pictures|Skydance Productions | 2015-06-23 | 2598 | 5.8 | 2015 | 1.425999e+08 | 4.053551e+08 | 2.856035e+08 | High |
| 7 | 7 | 286217 | tt3659388 | 7.667400 | 108000000.0 | 5.953803e+08 | The Martian | Matt Damon|Jessica Chastain|Kristen Wiig|Jeff Daniels|Michael Peña | http://www.foxmovies.com/movies/the-martian | Ridley Scott | Bring Him Home | based on novel|mars|nasa|isolation|botanist | During a manned mission to Mars, Astronaut Mark Watney is presumed dead after a fierce storm and left behind by his crew. But Watney has survived and finds himself stranded and alone on the hostile planet. With only meager supplies, he must draw upon his ingenuity, wit and spirit to subsist and find a way to signal to Earth that he is alive. | 141 | Drama|Adventure|Science Fiction | Twentieth Century Fox Film Corporation|Scott Free Productions|Mid Atlantic Films|International Traders|TSG Entertainment | 2015-09-30 | 4572 | 7.6 | 2015 | 9.935996e+07 | 5.477497e+08 | 4.873803e+08 | High |
| 8 | 8 | 211672 | tt2293640 | 7.404165 | 74000000.0 | 1.156731e+09 | Minions | Sandra Bullock|Jon Hamm|Michael Keaton|Allison Janney|Steve Coogan | http://www.minionsmovie.com/ | Kyle Balda|Pierre Coffin | Before Gru, they had a history of bad bosses | assistant|aftercreditsstinger|duringcreditsstinger|evil mastermind|minions | Minions Stuart, Kevin and Bob are recruited by Scarlet Overkill, a super-villain who, alongside her inventor husband Herb, hatches a plot to take over the world. | 91 | Family|Animation|Adventure|Comedy | Universal Pictures|Illumination Entertainment | 2015-06-17 | 2893 | 6.5 | 2015 | 6.807997e+07 | 1.064192e+09 | 1.082731e+09 | High |
| 9 | 9 | 150540 | tt2096673 | 6.326804 | 175000000.0 | 8.537086e+08 | Inside Out | Amy Poehler|Phyllis Smith|Richard Kind|Bill Hader|Lewis Black | http://movies.disney.com/inside-out | Pete Docter | Meet the little voices inside your head. | dream|cartoon|imaginary friend|animation|kid | Growing up can be a bumpy road, and it's no exception for Riley, who is uprooted from her Midwest life when her father starts a new job in San Francisco. Like all of us, Riley is guided by her emotions - Joy, Fear, Anger, Disgust and Sadness. The emotions live in Headquarters, the control center inside Riley's mind, where they help advise her through everyday life. As Riley and her emotions struggle to adjust to a new life in San Francisco, turmoil ensues in Headquarters. Although Joy, Riley's main and most important emotion, tries to keep things positive, the emotions conflict on how best to navigate a new city, house and school. | 94 | Comedy|Animation|Family | Walt Disney Pictures|Pixar Animation Studios|Walt Disney Studios Motion Pictures | 2015-06-09 | 3935 | 8.0 | 2015 | 1.609999e+08 | 7.854116e+08 | 6.787086e+08 | High |
| Unnamed: 0 | id | imdb_id | popularity | budget | revenue | original_title | cast | homepage | director | tagline | keywords | overview | runtime | genres | production_companies | release_date | vote_count | vote_average | release_year | budget_adj | revenue_adj | profit | popularity_level | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1277 | 10338 | 8291 | tt0107840 | 0.313792 | 14000000.0 | 27515786.0 | Poetic Justice | Janet Jackson|Tupac Shakur|Regina King|Joe Torry|Tyra Ferrell | http://www.janetjackson.com | John Singleton | A Street Romance. | loss of lover|sadness|los angeles|road movie | In this film, we see the world through the eyes of main character Justice, a young African-American poet. A mail carrier invites a few friends along for a long overnight delivery run. | 109 | Drama|Romance | Columbia Pictures | 1993-07-23 | 24 | 6.8 | 1993 | 2.113258e+07 | 4.153425e+07 | 13515786.0 | Low |
| 1278 | 10401 | 667 | tt0062512 | 1.554808 | 9500000.0 | 111584787.0 | You Only Live Twice | Sean Connery|Akiko Wakabayashi|Karin Dor|Mie Hama|TetsurÅ Tamba | http://www.mgm.com/view/movie/2347/You-Only-Live-Twice/ | Lewis Gilbert | You Only Live Twice...and Twice is the only way to live! | london|japan|england|assassination|helicopter | A mysterious space craft kidnaps a Russian and American space capsule and brings the world on the verge of another World War. James Bond investigates the case in Japan and meets with his archenemy Blofeld. The fifth film from the legendary James Bond series starring Sean Connery as the British super agent. | 117 | Action|Thriller|Adventure | Eon Productions | 2067-06-12 | 301 | 6.2 | 1967 | 6.209926e+07 | 7.294034e+08 | 102084787.0 | Moderately High |
| 1279 | 10438 | 657 | tt0057076 | 2.508235 | 2500000.0 | 78898765.0 | From Russia With Love | Sean Connery|Daniela Bianchi|Lotte Lenya|Robert Shaw|Bernard Lee | http://www.mgm.com/view/movie/717/From-Russia-With-Love/ | Terence Young | The world's masters of murder pull out all the stops to destroy Agent 007! | venice|london|terror|england|assassination | Agent 007 is back in the second installment of the James Bond series, this time battling a secret crime organization known as SPECTRE. Russians Rosa Klebb and Kronsteen are out to snatch a decoding device known as the Lektor, using the ravishing Tatiana to lure Bond into helping them. Bond willingly travels to meet Tatiana in Istanbul, where he must rely on his wits to escape with his life in a series of deadly encounters with the enemy | 115 | Action|Thriller|Adventure | Eon Productions|Metro-Goldwyn-Mayer (MGM)|Danjaq | 2063-10-11 | 458 | 6.7 | 1963 | 1.780045e+07 | 5.617734e+08 | 76398765.0 | High |
| 1280 | 10489 | 6978 | tt0090728 | 0.960984 | 25000000.0 | 11000000.0 | Big Trouble in Little China | Kurt Russell|Kim Cattrall|Dennis Dun|James Hong|Victor Wong | http://www.theofficialjohncarpenter.com/big-trouble-in-little-china/ | John Carpenter | Adventure doesn't come any bigger! | kung fu|chinatown|magic | When trucker Jack Burton agreed to take his friend Wang Chi to pick up his fiancee at the airport, he never expected to get involved in a supernatural battle between good and evil. Wang's fiancee has emerald green eyes, which make her a perfect target for an immortal sorcerer named Lo Pan and his three invincible cronies. Lo Pan must marry a girl with green eyes so he can regain his physical form. | 99 | Adventure|Fantasy|Action|Comedy | Twentieth Century Fox Film Corporation|TAFT Entertainment Pictures | 1986-05-30 | 347 | 6.7 | 1986 | 4.973516e+07 | 2.188347e+07 | -14000000.0 | Medium |
| 1281 | 10594 | 9552 | tt0070047 | 2.010733 | 8000000.0 | 441306145.0 | The Exorcist | Linda Blair|Max von Sydow|Ellen Burstyn|Jason Miller|Lee J. Cobb | http://theexorcist.warnerbros.com/ | William Friedkin | Something almost beyond comprehension is happening to a girl on this street, in this house... and a man has been sent for as a last resort. This man is The Exorcist. | exorcism|holy water|religion and supernatural|vomit|christian | 12-year-old Regan MacNeil begins to adapt an explicit new personality as strange events befall the local area of Georgetown. Her mother becomes torn between science and superstition in a desperate bid to save her daughter, and ultimately turns to her last hope: Father Damien Karras, a troubled priest who is struggling with his own faith. | 122 | Drama|Horror|Thriller | Warner Bros.|Hoya Productions | 1973-12-26 | 1113 | 7.2 | 1973 | 3.928928e+07 | 2.167325e+09 | 433306145.0 | Moderately High |
| 1282 | 10595 | 253 | tt0070328 | 1.549139 | 7000000.0 | 161777836.0 | Live and Let Die | Roger Moore|Yaphet Kotto|Jane Seymour|Clifton James|Julius Harris | http://www.mgm.com/view/movie/1130/Live-and-Let-Die/ | Guy Hamilton | Roger Moore is James Bond. | london|new york|bomb|england|spy | James Bond must investigate a mysterious murder case of a British agent in New Orleans. Soon he finds himself up against a gangster boss named Mr. Big. | 121 | Adventure|Action|Thriller | Eon Productions|Metro-Goldwyn-Mayer (MGM) | 1973-07-05 | 293 | 6.1 | 1973 | 3.437812e+07 | 7.945168e+08 | 154777836.0 | Moderately High |
| 1283 | 10689 | 660 | tt0059800 | 1.910465 | 11000000.0 | 141195658.0 | Thunderball | Sean Connery|Claudine Auger|Adolfo Celi|Luciana Paluzzi|Rik Van Nutter | http://www.mgm.com/view/movie/2009/Thunderball/ | Terence Young | Look up! Look down! Look out! | paris|florida|fighter pilot|sanatorium|secret organization | A criminal organization has obtained two nuclear bombs and are asking for a 100 million pound ransom in the form of diamonds in seven days or they will use the weapons. The secret service sends James Bond to the Bahamas to once again save the world. | 130 | Adventure|Action|Thriller | Eon Productions|Metro-Goldwyn-Mayer (MGM) | 2065-12-16 | 331 | 6.3 | 1965 | 7.612620e+07 | 9.771535e+08 | 130195658.0 | Moderately High |
| 1284 | 10724 | 668 | tt0064757 | 1.778746 | 7000000.0 | 81974493.0 | On Her Majesty's Secret Service | George Lazenby|Diana Rigg|Telly Savalas|Gabriele Ferzetti|Ilse Steppat | http://www.mgm.com/view/movie/1411/On-Her-Majesty%E2%80%99s-Secret-Service/ | Peter R. Hunt | Far up! Far out! Far more! James Bond 007 is back! | london|suicide|england|switzerland|secret identity | James Bond tracks archnemesis Ernst Blofeld to a mountaintop retreat where he's training an army of beautiful but lethal women. Along the way, Bond falls for Italian contessa Tracy Draco -- and marries her in order to get closer to Blofeld. Meanwhile, he locates Blofeld in the Alps and embarks on a classic ski chase. | 142 | Adventure|Action|Thriller | Eon Productions|Metro-Goldwyn-Mayer (MGM)|Danjaq | 2069-12-12 | 258 | 6.4 | 1969 | 4.160985e+07 | 4.872780e+08 | 74974493.0 | Moderately High |
| 1285 | 10759 | 948 | tt0077651 | 1.198849 | 300000.0 | 70000000.0 | Halloween | Donald Pleasence|Jamie Lee Curtis|P.J. Soles|Nancy Kyes|Nick Castle | http://www.theofficialjohncarpenter.com/halloween/ | John Carpenter | The Night HE Came Home! | female nudity|nudity|mask|babysitter|halloween | A psychotic murderer, institutionalized since childhood for the murder of his sister, escapes and stalks a bookish teenage girl and her friends while his doctor chases him through the streets. | 91 | Horror|Thriller | Compass International Pictures|Falcon International Productions | 1978-10-25 | 522 | 7.3 | 1978 | 1.002810e+06 | 2.339890e+08 | 69700000.0 | Moderately High |
| 1286 | 10760 | 8469 | tt0077975 | 1.157930 | 2700000.0 | 141000000.0 | Animal House | John Belushi|Tim Matheson|John Vernon|Verna Bloom|Tom Hulce | http://www.animalhouse.com/ | John Landis | It was the Deltas against the rules... the rules lost! | female nudity|sex|nudity|collage|fraternity | At a 1962 College, Dean Vernon Wormer is determined to expel the entire Delta Tau Chi Fraternity, but those troublemakers have other plans for him. | 109 | Comedy | Universal Pictures|Oregon Film Factory|Stage III Productions | 1978-07-27 | 230 | 6.7 | 1978 | 9.025292e+06 | 4.713208e+08 | 138300000.0 | Moderately High |